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SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC ACIDS ENCODING 

THE SAME 

FIELD OF THE INVENTION 
The present invention relates generally to the identification and isolation of novel DNA and to the 
5 recombinant production of novel polypeptides. 

BACKGROUND OF THE INVENTION 
Extracellular proteins play important roles in, among other things, the formation, differentiation and 
maintenance of multicellular organisms. The fate of many individual cells, e.g., proliferation, migration, 

10 differentiation, or interaction widi other cells, is typically governed by information received from other cells * 
and/or the immediate environment. This information is often transmitted by secreted polypeptides (for 
instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and 
hormones) which are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. 
These secreted polypeptides or signaling molecules normally pass through the cellular secretory pathway to 

15 reach their site of action in the extracellular environment. 

Secreted proteins have various industrial applications, including as pharmaceuticals, diagnostics, 
biosensors and bioreactors. Most protein drugs available at present, such as thrombolytic agents, interferons, 
interleukins, erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteins. 
Their receptors, which are membrane proteins, also have potential as therapeutic or diagnostic agents. Efforts 

20 are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts 
are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for 
novel secreted proteins. Examples of screening methods and techniques are described in the literature [see, 
for example, Klein et al., Proc. Natl. Acad. Sci. 93:7108-7113 (1996); U.S. Patent No. 5.536,637)]. 

Membrane-bound proteins and receptors can play important roles in, among other things, the 

25 formation, differentiation and maintenance of multicellular organisms. The fate of many individual cells, e.g. , 
proliferation, migration, differentiation, or interaction with other cells, is typically governed by information 
received from other cells and/or the immediate environment. This information is often transmitted by secreted 
polypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, 
neuropeptides, and hormones) which are, in turn, received and interpreted by diverse cell receptors or 

30 membrane-bound proteins. Such membrane-bound proteins and cell receptors include, but are not limited to, 
cytokine receptors, receptor kinases, receptor phosphatases, receptors involved in cell-cell interactions, and 
cellular adhesin molecules like selectins and integrins. For instance, transduction of signals that regulate cell 
growth and differentiation is regulated in part by phosphorylation of various cellular proteins. Protein tyrosine 
kinases, enzymes that catalyze that process, can also act as growth factor receptors. Examples include 
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fibroblast growth factor receptor and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial applications, including as 
pharmaceutical and diagnostic agents. Receptor immunoadhesins, for instance, can be employed as therapeutic 
agents to block receptor-ligand interactions . The membrane-bound proteins can also be employed for screening 
of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction. 
5 Efforts are being imdertaken by both industry and academia to identify new, native receptor or 

membrane-bound proteins. Many efforts are focused on the screening of mammalian recombinant DNA 
libraries to identify the coding sequences for novel receptor or membrane-boimd proteins. 

U PR0281 

10 A novel gene designated testis enhanced gene transcript (TEGT) has recently been identified in humans 

(Walter et al.. Genomics 20:301-304 (1995)). Recent results have shown that TEGT protein is 
developmentally regulated in the mammalian testis and possesses a nuclear targeting motif that allows the 
protein to localize to the nucleus (Walter et al., Mamm. Genome 5:216-221 (1994)). As such, it is believed 
that the TEGT protein plays an important role in testis development. There is, therefore, substantial interest 

15 in identifying and characterizing novel polypeptides having homology to the TEGT protein. We herein 
describe the identification and characterization of novel polypeptides having homology to TEGT protein, 
designated herein as PR0281 polypeptides. 

2. PR0276 

20 Efforts are being undertaken by both industry and academia to identify new, native membrane-bound 

proteins. Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to 
identify the coding sequences for novel membrane-bound proteins. We herein describe the identification and 
characterization of novel transmembrane polypeptides, designated herein as PR0276 polypeptides. 

25 3. FR0189 

Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. 
Many of these efforts are focused on the screemng of mammalian recombinant DNA libraries to identify the 
coding sequences for novel secreted proteins. We herein describe the identification and characterization of 
novel secreted polypeptides, designated herein as PRO 189 polypeptides. 

30 

4. PRO190 

Of particular interest are proteins having seven transmembrane domains (7TM), or more generally, 
all multiple transmembrane spanning proteins. Among multiple transmembrane spanning proteins are ion 
channels and transporters. Examples of transporters are the UDP-galactose transporter described in Ishida, 
35 et al., J. Biochem .. 120(6): 1074-1078 (1996), and the CMP-sialic acid transporter described in Eckhardt, et 
al., PNAS . 93(15):7572-7576 (1996). We herein describe the identification and characterization of novel 
transmembrane polypeptides, designated herein as PRO 190 polypeptides. 
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therapy. We herein describe the identification and characterization of novel polypeptides having homology 
to the L6 cell surface tumor cell-associated antigen, designated herein as PR0994 polypeptides. 

74. PR0812 

Steroid binding proteins play important roles in numerous physiological processes associated with 
5 steroid function. Specifically, one steroid binding protein-associated polypeptide that has been well 
characterized is component 1 of die prostatic binding protein. Component 1 of the prostatic binding protein 
has been shown to be specific for subunit F of the prostatic binding protein, the major secretory glycoprotein 
of the rat ventral prostate (Peeters et al.. Eur. J. Biochem. 123:55-62 (1982) and Liao et al., J. Biol. Chem. 
257:122-125 (1982)). The amino acid sequence of component 1 of the prostatic binding protein has been 
10 determined, wherein the sequence is highly rich in glutamic acid residues and is overall highly acidic. This 
protein plays an important role in the response of the prostate gland to steroid hormones. We herein describe 
the identification and characterization of novel polypeptides having homology to prostatic steroid-binding 
protein cl, designated herein as PR0812 polypeptides. 

15 75. PRO1069 

Of particular interest is die identification of new membrane-bound proteins involved in ion 
conductance such as channel inhibitory factor (CHIP) and MAT-8, which have recently been reported (see 
Wald et al.. Am. J. PhvsioL 272(5 pt 2): F617-F623 (1997); Capurro et al., Am. J. Physiol . 271(3 pt 1): 
C753-C762 (1996); Wald et al. , Am. J. Physiol . 271(2 pt 2): F322-F329 (1996); and Morrison et al. , J. Biol. 
20 Chem 270(5):2176-2182 (1995)). 

Described herein is the identification and characterization of novel polypeptides having homology to 
CHIP and MAT-8 polypeptides, designated herein as PRO 1069 polypeptides. 

76. PROil29 

25 Cytochromes P-450 are a superfamily of hemoproteins which represent the main pathway for drug 

and chemical oxidation (Horsmans, Acta Gastroenterol. Betg. 60:2-10 (1997)). This superfamily is divided 
into families, subfamilies and/or single enzymes. Recent reports have provided a great deal of information 
concerning the cytochrome P-450 isozymes and increased awareness of life threatening interactions with such 
commonly prescribed drugs as cisapride and some antihistamines (Michalets, Pharmacotherapy 18:84-112 

30 (1998) and Singer et al., J. Am. Acad. Dermatol. 37:765-771 (1997)). Given this information, there is 
significant interest in identifying novel members of the cytochrome P-450 family of proteins. We herein 
describe the identification and characterization of novel polypeptides having homology to cytochrome P-450 
proteins, designated herein as PR01129 polypeptides. 

35 77. PRO1068 

Urotensins are neurosecretory proteins that are of interest because of their potential roles in a variety 
of physiological processes including smooth muscle contraction (Yano et al. Gen. Comp. Endocrinol. 96(3): 
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135. y)R01384 

One class of receptor proteins that has been of interest is the NKG2 family of type II transmembrane 
molecules that are expressed in natural killer cells. These proteins, which have been shown to be covalently 
associated with CD94, are involved in natural killer cell-mediated recognition of different HLA-allotypes 
(Plougastel, B. etal., Eur. J, Immunol. (1997) 27(1 0 :2835-2839). and interact with major histocompatibility 
5 complex (MHC) class I to either inhibit or activate functional activity (Ho, EL. et al. , Proc. Natl. Acad. Sd. 
(1998) 95(1 1) :6320-6325). Accordingly, the identification and characterization of new members of this family 
of receptor proteins is of interest (see Houchins JP, et al. J. Exp. Med. (1991) 173(4) : 1017-1020). 

SUMMARY OF THE INVENTION 

10 1. PR0281 

A cDNA clone (DNA 16422- 1209) has been identified, having homology to nucleic acid encoding 
testis enhanced gene transcript (TEGT) protein that encodes a novel polypeptide, designated in the present 
application as "PR028r. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA 
15 encoding a PR0281 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR0281 polypeptide having 
the sequence of amino acid residues from about 1 or about 15 to about 345, inclusive of Figure 2 (SEQ ID 
20 NO:2), or (b) the complemeiu of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR0281 
polypeptide con^rising DNA hybridizing to the complement of the nucleic acid between about nucleotides 80 
or about 122 and about 1114, inclusive, of Figure 1 (SEQ ID NO: 1 ). Preferably, hybridization occurs under 
stringent hybridization and wash conditions. 
25 In a fiirther aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209929 
(DNA 16422-1209) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
30 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA 
in ATCC Deposit No. 209929 (DNA 16422- 1209). 

In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
35 identity to the sequence of amino acid residues 1 or about 15 to about 345, inclusive of Figure 2 (SEQ ID 
N0:2), or (b) the complement of the DNA of (a). 
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In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PR0994 polypeptide having the 
sequence of amino acid residues from about 1 to about 229, inclusive of Figure 176 (SEQ ID NO:258), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
5 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell, 
comprismg the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PR0994 
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR0994 antibody. 
10 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of 

a native PR0994 polypeptide by contacting the native PR0994 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still fiirther embodiment, the invention concerns a composition comprising a PR0994 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

15 

74. PRQgll 

A cDNA clone (DNA59205-1421) has been identified, having homology to nucleic acid encoding 
prostatic steroid-binding protein ct that encodes a novel polypeptide, designated in the present application as 
"PR0812". 

20 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA 

encoding a PR0812 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROS 12 polypeptide having 

25 the sequence of amino acid residues from about 1 or about 16 to about 83, inclusive of Figure 178 (SEQ ID 
NO:260), or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROS 12 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 55 
or about 100 and about 303, inclusive, of Figure 177 (SEQ ID NO:259). Preferably, hybridization occurs 

30 under stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mamre polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203009 

35 (DNA59205-1421) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA 
in ATCC Deposit No. 203009 (DNA59205-1421). 
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In still a ftirther aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DN A 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 16 to about 83, inclusive of Figure 178 (SEQ ID 
NO:260), or (b) the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 
molecule encoding a PR0812 polypeptide having the sequence of amino acid residues from 1 or about 16 to 
about 83, inclusive of Figure 178 (SEQ ID NO:260), or (b) the complement of the DNA molecule of (a), and, 
if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA 
encoding a PROS 12 polypeptide, with or without the N-terminal signal sequence and/or the initiating 
methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been 
tentatively identified as extending from about amino acid position 1 to about amino acid position 15 in the 
sequence of Figure 178 (SEQ ID NO:260). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or about 16 to about 83, inclusive of Figure 178 (SEQ ID NO:260), or (b) 
the complement of the DNA of (a). 

Another embodiment is directed to fragments of a PROS 12 polypeptide coding sequence that may fmd 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 177 (SEQ ID NO: 259). 

In another embodiment, the invention provides isolated PR0812 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PROS 12 polypeptide, which in 
certain embodiments, includes an amino acid sequence comprising residues 1 or about 16 to about 83 of Figure 
178 (SEQ ID NO:260). 

In another aspect, the invention concerns an isolated PROS 12 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or about 16 to about 83, inclusive of Figure 178 (SEQ ID NO: 260). 

. In a further aspect, the invention concerns an isolated PROS 12 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least 

167 
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about 90% positives, most preferably at least about 95% positives when compared with the amino acid 
sequence of residues 1 or about 16 to about 83, inclusive of Figure 178 (SEQ ID NO:260). 

In yet another aspect, the invention concerns an isolated PROS 12 polypeptide, comprising the 
sequence of amino acid residues 1 or about 16 to about 83, inclusive of Figure 178 (SEQ ID NO:260), or a 
fragment thereof sufficient to provide a binding site for an anti-PR0812 antibody. Preferably, the PR08I2 
fragment retains a qualitative biological activity of a native PROS 12 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROS 12 polypeptide having the 
sequence of amino acid residues from about 1 or about 16 to about 83, inclusive of Figure 178 (SEQ ID 
NO:260), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about 
a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a 
host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and 
(iii) recovering the polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PROS 12 
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR0812 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of 
a native PR0812 polypeptide by contacting the native PR0812 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PR0812 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

75. PRO1069 

Applicants have identified a cDNA clone, DNA5921 1-1450, that encodes a novel polypeptide having 
homology to CHIP wherein the polypeptide is designated in the present application as "PRO 1069". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA 
encoding a PRO 1069 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1069 polypeptide 
having the sequence of amino acid residues from 1 or about 17 to about 89, inclusive of Figure 180 (SEQ ID 
NO:262), or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1069 
polypeptide comprising DNA that hybridizes to the complement of the nucleic acid sequence having about 
residues 197 or about 245 to about 463, inclusive of Figure 179 (SEQ ID NO:261). Preferably, hybridization 
occurs under stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
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transmembrane domain at about amino acids 32-57. and potential N-glycosylation sites at about amino acids 
68-7U 120-123, and 134-137. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 306 (SEQ ID NO:424), revealed homology 
between the PROI384 amino acid sequence and die following Dayhoff sequences: AF054819_1 » HSAJ1687_1 » 
AF009511_1» AB010710_1, GEN13595, HSAJ673_1, GEN13961. AB005900_1, LECH_CHICK, 
AF021349_1, and NK13_RAT. 

Clone DNA71 159-1617 has been deposited with ATCC and is assigned ATCC deposit no. 203135. 

EXAMPLE 139 : Use of PRO as a hybridization orobe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 

DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed 
as a probe to screen for homologous DNAs (such as those encoding naturally-occurring variants of PRO) in 
human tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed tmder the following 
high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a 
solution of 50% formamide, 5x SSC, 0. 1 % SDS, 0. 1 % sodium pyrophosphate, 50 mM soditim phosphate, pH 
6.8, 2x Denhardfs solution, and 10% dextran sulfate at 42<*C for 20 hours. Washing of the filters is performed 
in an aqueous solution of 0.1 x SSC and 0.1% SDS at 42<*C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO 
can then be identified using standard techniques known in the art. 

EXAMPLE 140 : Expression of PRO in E, coli 

This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PGR primers. The primers 
should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector, A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli; see Bolivar et al.. Gene . 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PGR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which 
encode for an antibiotic resistance gene, a trp promoter, a poly his leader (including the first six STII codons, 
polyhis sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, 
and an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described 
in Sambrook et al., supra . Transformants are identified by their ability to grow on LB plates and antibiotic 
resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and 
DNA sequencing. 
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Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 
pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the 
solubilized PRO protein can then be purified using a metal chelating column under conditions that allow tight 
binding of the protein. 

PRO may be expressed in E. coU in a poly-His tagged form, using the following procedure. The 
DNA encoding PRO is initially amplified using selected PGR primers. The primers will contain restriction 
enzyme sites which correspond to the restriction enzyme sites on the selected expression vector, and other 
useful sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
colunm, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W31 10 
fuhA(tonA) Ion galE rpoHts(htpRts) clpPGacIq). Transformants are first grown in LB containing 50 mg/ml 
carbenicillin at 30**C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold 
into CRAP media (prepared by mixing 3.57 g {NH4)2S04, 0.71 g sodium citrate*2H20, 1.07 g KCl, 5,36 g 
Difco yeast extract. 5.36 g Sheffield hycasc SF in 500 mL water, as well as 110 mM MPOS, pH 7.3, 0.55% 
(w/v) glucose and 7 mM MgS04) and grown for approximately 20-30 hours at 30*C with shaking. Samples 
are removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifiiged to pellet the cells. 
Cell pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 
M guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. IM and 0.02 M, respectively, and the solution is stirred overnight at 4**C. This step results 
in a denamred protein with all cysteine residues blocked by sulfitolization. The solution is centrifiiged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 
clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate colunm equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4''C. Protein concentration is estimated by its absorbance at 280 nra 
using the calculated extinction coefficient based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer 
consisting of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM 
EDTA. Refolding volumes are chosen so that the final protein concentration is between 50 to 100 
micrograms/ml. The refolding solution is stirred gently at 4*'C for 12-36 hours. The refolding reaction is 
quenched by the addition of TFA to a fmal concentration of 0.4% (pH of approximately 3). Before further 
purification of the protein, the solution is filtered through a 0.22 micron filter and acetonitrile is added to 
2-10% final concentration. The refolded protein is chromatographed on a Poros R 1 /H reversed phase column 
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using a mobile buffer of 0. 1 % TFA with elution with a gradient of acetonitrile from ] 0 to 80%. Aliquots of 
fractions with A280 absorbance are analyzed on SDS polyacrylamide gels and fractions containing 
homogeneous refolded protein are pooled. Generally, the properly refolded species of most proteins are eluted 
at the lowest concentrations of acetonitrile since those species are the most compact with their hydrophobic 
interiors shielded from interaction with the reversed phase resin. Aggregated species are usually eluted at 
5 higher acetonitrile concentrations. In addition to resolving misfolded forms of proteins from the desired form, 
the reversed phase step also removes endotoxin from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0.14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) 
10 resins equilibrated in the formulation buffer and sterile filtered. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 141 : Expression of PRO in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
15 e;q)ression in mammalian cells. 

The vector, pRK5 (see EP 307,247, published March 15, 1989), is en^Ioyed as the expression vector. 
Optionally, the PRO DNA is Ugated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et al. , supra . The resulting vector is called pRK5- 
PRO. 

20 In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) 

are grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum 
and optionally, nutrient components and/or antibiotics. About 10 fig pRK5-PRO DNA is mixed with about 
1 fig DNA encoding the VA RNA gene [Thimmappaya et al.. Cell, 31:543 (1982)] and dissolved in 500 ft] 
of 1 mM Tris-HCl, 0.1 mM EDTA, 0.227 M CaClj. To this mixture is added, dropwise, 500 yxl of 50 mM 

25 HEPES (pH 7.35), 280 mM NaCl. 1.5 mM NaP04, and a precipitate is allowed to form for 10 minutes at 
25*^. The precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 
37"C. The culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 
293 cells are then washed with serum free medium, fresh medium is added and the cells are incubated for about 
5 days. 

30 Approximately 24 hours after the transfections, the culture medium is removed and replaced with 

culture medium (alone) or culture medium containing 200 ^Ci/ml ^'S-cysteine and 200 ^tCi/ml ^^S-methionine. 
After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto 
a 15% SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal 
the presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation 

35 (in serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate 
method described by Somparyrac et al., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to 



wo 00/73454 




PCT/USOO/08439 



maximal density in a spinner flask and 700 fxg pRK5-PRO DNA is added. The cells are first concentrated from 
the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the 
cell pellet for four hours, The cells are treated with 20% glycerol for 90 seconds, washed with tissue culmre 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 /tg/ml bovine insulin and 
0.1 /ig/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to 
5 remove cells and debris. The sample containing expressed PRO can then be concentrated and purified by any 
selected method, such as dialysis and/or colimin chromatography. 

In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into 
CHO cells using known reagents such as CaPO* or DEAE-dextran. As described above, the cell cxiltures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such 
10 as "S-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced 
with serum free medium. Preferably, the culnires are incubated for about 6 days, and then die conditioned 
medium is harvested. The medium containing die expressed PRO can then be concentrated and purified by 
any selected method, 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of 
15 die pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as 
a poly-his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can dien be subcloned 
into a SV40 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, 
the CHO cells can be transfected (as described above) with the SV40 driven vector. Labeling may be 
performed, as described above, to verify expression. The culture medium containing the expressed poly-His 
20 tagged PRO can then be concentrated and purified by any selected method, such as by Ni* "^-chelate affinity 
chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 
cells by anodier stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are 

25 expressed as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. 

extracellular domains) of the respective proteins are fused to an IgGl constant region sequence containing the 
hinge, CH2 and CH2 domains and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 
standard techniques as described in Ausubel et al.. Current Protocols of Molecular Biology . Unit 3. 16, John 

30 Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5* and 
3' of the DNA of interest to allow the convenient shunling of cDNA's. The vector used expression in CHO 
cells is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses die SV40 eariy 
promoter/enhancer to drive expression of Uie cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

35 Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO 

cells using commercially available transfection reagents Superfect*(Quiagen), Dosper* or Fugene' (Boehringer 
Mannheim). The cells are grown as described in Lucas et al., supra . Approximately 3 x 10"' cells are frozen 
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in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
fzm filtered PS20 with 5% 0.2 fzm diafiltered fetal bovine senmi). The cells are then aliquoted into a 100 mL 
spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 3TC. After another 2-3 days. 250 mL. 500 mL 
and 2000 mL spinners are seeded with 3 x 10^ cells/mL. The cell media is exchanged with fresh media by 
centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No, 5,122,469, issued June 16, 1992 may actually be used. 
A 3L production spinner is seeded at 1.2 x 10* cells/mL. On day 0, the cell number pH ie determined. On 
day 1, the spiimer is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, 
the temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g.. 35% 
polydimethylsiloxane emulsion, Dow Coming 365 Medical Grade Emulsion) taken. Throughout the 
production, the pH is adjtisted as necessary to keep it at around 7.2. After 10 days, or until the viability 
dropped below 70% , the cell culture is harvested by centrifugation and filtering through a 0.22 f^m filter. The 
filtrate was either stored at 4*'C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). 
Before purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned 
media is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 
M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4X1. After loading, the column is washed with 
additional equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. 
The highly purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0. 14 M 
NaCl and 4% mannitol, pH 6.8, with a 25 ml 025 Superfine (Pharmacia) column and stored at -80°C. 

Inununoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pimiped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3,5. The eluted protein is immediately neutralized by collecting 
1 ml fractions into tubes containing 275 ^L of 1 M Tris buffer, pH 9. The highly purified protein is 
subsequently desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity 
is assessed by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 142 : Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO fi:om 
the ADH2/GAPDH promoter. DNA encoding PRO and the promoter is inserted into suitable restriction 
enzyme sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding 
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PRO can be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a 
native PRO signal peptide or other mammalian signal peptide, or. for example, a yeast alpha-factor or invertase 
secretory signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain ABllO, can then be transformed with the expression plasmids 
described above and cultured in selected fermentation media. The transformed yeast supematants can be 
5 analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining 
of the gels with Coomassie Blue stain. 

Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 
10 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



EXAMPLE 143 : Expression of PRO in Baculo virus-Infected Insect Cells 

The fol lowing method describes recombinant expression of PRO in Baculo virus-infected insect cells . 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovinis 

1 5 expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG) . 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence 
of PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PGR with primers complementary 

20 to the 5* and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The 
product is then digested with those selected restriction enzymes and subcloned into the expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus 
DNA (Pharmingen) into Spodoptera fmgiperda ("Sf9") cells (ATCC CRL 171 1) using lipofectin (commercially 
available from GIBCO-BRL). After 4 - 5 days of incubation at 28*^, the released viruses are harvested and 

25 used for further an:^)lifications. Viral infection and protein expression are performed as described by O'Reilley 
et al., Baculovirus expression vectors: A Laboratory Manual . Oxford: Oxford University Press (1994). 

Expressed poly-his tagged PRO can then be purified, for example, by Ni^ ^-chelate affinity 
chromatography as follows. Extracts are prepared from recombinant virus-infected Sf? cells as described by 
Rupert et al.. Nature . 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer 

30 (25 mL Hepes, pH 7.9; 12.5 mM MgClj; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KCl). and 
sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is 
diluted 50-fold in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered 
through a 0.45 /^m filter. A Ni^"^-NTA agarose column (commercially available from Qiagen) is prepared with 
a bed volume of 5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The 

35 filtered cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to baseline A^go 
with loading buffer, at which point fraction collection is started. Next, the column is washed with a secondary 
wash buffer (50 mM phosphate; 300 mM NaCl. 10% glycerol, pH 6.0), which elutes nonspecifically bound 
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protein. After reaching Amobaseline again, the column ts developed with a 0 to 500 mM Imidazole gradient 
in the secondary wash buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining 
or Western blot with Ni^'^-NTA-conjugated to alkaline phosphatase (Qiagen), Fractions containing the eluted 
His,o-tagged PRO are pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 
5 chromatography techniques, including for instance. Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 144 : Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 

10 Techniques for producing the monoclonal antibodies are known in the art and are described, for 

instance, in Coding, supra . Immimogens that may be employed include purified PRO, fusion proteins 
containing PRO» and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can 
be made by the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO inmiunogen emulsified in complete Freund's 

15 adjuvant and injected subcutaneously or intraperitoneally in an amount from 1 -100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into the animal's hind foot pads. The immunized nuce are then boosted 10 to 12 days later with 
additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also 
be boosted with additional immunization injections. Serum samples may be periodically obtained from the 

20 mice by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
are harvested. The spleen cells are then fused (using 35 % polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU. 1, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 

25 which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fiised cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
"positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the 
art. 

30 The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 

ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in 
tissue culmre flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

35 

EXAMPLE 145 : Purification of PRO PolvDeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the 
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art of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO 
polypeptide is purified by immunoaffinity chromatography using antibodies specific for the PRO polypeptide 
of interest. In general, an immunoaffinity colunm is constructed by covalently coupling the antt-PRO 
polypeptide antibody to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared fi-om immune sera either by precipitation with ammonium 
sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.), 
Likewise, monoclonal antibodies are prepared fix>m mouse ascites fluid by ammonium sulfate precipitation or 
chromatography on immobilized Protein A. Panially purified immunoglobulin is covalently attached to a 
chromatographic resin such as CnBr-activated SEPH AROSE*^" (Pharmacia LKB Biotechnology) . The antibody 
is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immunoaffinity colimm is utilized in the purification of PRO polypeptide by preparing a 
fraction fi-om cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization 
of the whole cell or of a subcellular fraction obtained via differential centrifiigation by the addition of detergent 
or by other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal 
sequence may be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptide-containing preparation is passed over the immunoaffinity column, and the 
colunm is washed under conditions that allow the preferential absorbance of PRO polypeptide {e.g. , high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
antibody /PRO polypeptide binding {e.g., a low pH buffer such as approximately pH 2-3, or a high 
concentration of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 



EXAMPLE 146 : Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 
fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be firee in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
be used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent 
with an PRO polypeptide or fragment thereof and assaying (1) for the presence of a complex between the agent 
and the PRO polypeptide or fi-agment, or (ii) for the presence of a complex between the PRO polypeptide or 
fi-agment and the cell, by methods well known in the art. In such competitive binding assays, the PRO 
polypeptide or fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is 
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separated from that present in bound fonn, and the amount of free or uncomplexed label is a measure of the 
ability of the particular agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell 
complex. 

Another technique for drug screening provides high throughput screening for compounds having 
suitable binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 
5 13, 1984. Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid 
substrate, such as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test 
compounds are reacted with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods 
well known in the art. Purified PRO polypeptide can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. In addition, nbn-neutralizmg antibodies can be used to capture the 
10 peptide and immobilize it on the solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 

15 

E^MPLE 147: Rational Pryg Design 

The goal of rational drug design is to produce strucmral analogs of biologically active polypeptide of 
interest {i.e. , a PRO polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, 
or inhibitors. Any of these examples can be used to fashion drugs i^ch are more active or stable forms of 

20 the PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo {c.f., 
Hodgson. Bio/Technology . 9: 19-21 (1991)). 

In one approach, the three-dimensional smicture of the PRO polypeptide, or of an PRO 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most 
typically, by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must 

25 be ascertained to elucidate the structure and to determine active site(s) of the molecule. Less often, useful 
information regarding the structure of the PRO polypeptide may be gained by modeling based on the structure 
of homologous proteins. In both cases, relevant strucmral information is used to design analogous FRO 
polypeptide-like molecules or to identify efficient inhibitors. Useful examples of rational drug design may 
include molecules which have improved activity or stability as shown by Braxton and Weils, Biochemistry. 

30 31:7796-7801 (1992) or which act as inhibitors, agonists, or antagonists of native peptides as shown by 
Athauda et al. , J. Biochem. . 113:742-746 (1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating 

35 anti-idiotypic antibodies (anti-ids) to a fimctional, pharmacologically active antibody. Asa mirror image of a 
mirror image, the binding site of the anti-ids would be expected to be an analog of the original receptor. The 
anti-id could then be used to identify and isolate peptides from banks of chemically or biologically produced 
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peptides. The isolated peptides would then act as the pharmacore. 

By virtue of the present invention, sufTicient amounts of the PRO polypeptide may be made available 
to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide 
amino acid sequence provided herein will provide guidance to those enq)loying computer modeling tedmiques 
in place of or in addition to x-ray crystallography. 



EXAMPLE 148: Stimulation of Heart Neonatal Hvoertroohv (Assay n 

This assay is designed to measure the ability of PRO polypeptides to stimulate hypertrophy of neonatal 
hean. PRO polypeptides testing positive in this assay are expected to be useful for the therapeutic treatment 
of various cardiac insufficiency disorders. 

Cardiac myocytes from 1-day old Harlan Sprague Dawley rats were obtained. Cells (180 /J at 7.5 x 
10*/ml, serum <0.1%, freshly isolated) are added on day 1 to 96-welI plates previously coated with 
DMEM/F12 + 4% PCS. Test samples containing the test PRO polypeptide or growth medium only (hegative 
control) (20 ^/well) are added directly to the wells on day 1 . PGF (20 /*l/well) is then added on day 2 at final 
concentration of 10^ M. The cells are then stained on day 4 and visually scored on day 5, wherein cells 
showing no increase in size as compared to negative controls are secured 0.0, cells showing a Small to moderate 
increase in size as compared to negative controls are scored 1.0 and cells showing a large increase in size as 
compared to negative controls are scored 2.0. A positive result in the assay is a score of 1.0 or greater. 

The following polypeptides tested positive in this assay: PR01312. 

EXAMPLE 149 : Stimulation of Endothelial Cell Proliferation f Assav 81 

This assay is designed to determine whether PRO polypeptides of die present invention show the 
ability to stimulate adrenal cortical capillary endothelial cell (ACE) growth. PRO polypeptides testing positive 
in this assay would be expected to be useful for the therapeutic treatment of conditions or disorders where 
angiogenesis would be beneficial including, for example, wound healing, and the like (as would agonists of 
these PRO polypeptides). Antagonists of the PRO polypeptides testing positive in this assay would be expected 
to be useful for the therapeutic treatment of cancerous tumors. 

Bovine adrenal cortical capillary endothelial (ACE) cells (from primary culture, maximum of 12-14 
passages) were plated in 96-well plates at 500 cells/well per 100 microliter. Assay media included low glucose 
DMEM, 10% calf serum, 2 mM glutamine, and IXpenicillin/streptomycin/fungizone. Control wells included 
the following: (1) no ACE cells added; (2) ACE cells alone; (3) ACE cells plus VEGF (5 ng/ml); and (4) ACE 
cells plus FGF (5ng/ml). The control or test sample, (in 100 microliter volumes), was then added to the wells 
(at dilutions of 1 % , 0. 1 % and 0.0 1 % , respectively) . The cell cultures were incubated for 6-7 days at 37 •'C/5 % 
COi- After the incubation, the media in the wells was aspirated, and the cells were washed IX widi PBS. An 
acid phosphatase reaction mixture (100 microliter; 0. IM sodium acetate, pH 5.5, 0.1 % Triton X-lOO, lOmM 
p-nitrophenyl phosphate) was then added to each well. After a 2 hour incubation at 37**C, the reaction was 
stopped by addition of 10 microliters IN NaOH. Optical density (OD) was measured on a microplate reader 
at 405 nm. 
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The activity of a PRO polypeptide was calculated as the fold increase in proliferation (as determined 
by the acid phosphatase activity, OD 405 nm) relative to (1 ) cell only background, and (2) relative to maximum 
stimulation by VEGF. VEGF (at 3-10 ng/ml) and FGF (at 1-5 ng/ml) were employed as an activity reference 
for maximum stimulation. Results of the assay were considered "positive" if (he observed stimulation was i. 
50% increase over background. VEGF (5 ng/ml) control at 1 % dilution gave 1.24 fold stimulation; FGF (5 
5 ng/ml) control at 1 % dilution gave 1.46 fold stimulation. 

The following PRO polypeptides tested positive in this assay: PROl 154 and PROl 186. 

EXAMPLE 150 : Inhibition of Vascular Endothelial Growth Factor (VEGFt Stimulated Proliferation of 
Endothelial Cell Growth (Assay 9) 

1 0 The ability of various PRO polypeptides to inhibit VEGF stimulated proliferation of endothelial cells 

was tested. Polypeptides testing positive in this assay are useful for inhibiting endothelial cell growth in 
manmials where such an effect would be beneficial, e.g., for inhibiting tumor growth. 

Specifically, bovine adrenal cortical capillary endothelial cells (ACE) (from primary culture, maximum 
of 12-14 passages) were plated in 96- well plates at 500 cells/well per 100 microliter. Assay media included 

15 low glucose DMEM, 10% calf serum, 2 mM glutamine, and IX penicillin/streptomycin/fungizone. Control 
wells included the following: (1) no ACE cells added; (2) ACE cells alone; (3) ACE cells plus 5 ng/ml FGF; 
(4) ACE cells plus 3 ng/ml VEGF; (5) ACE cells plus 3 ng/ml VEGF plus 1 ng/ml TGF-beta; and (6) ACE 
cells plus 3 ng/ml VEGF plus 5 ng/ml LIF. The test samples, poly-his tagged PRO polypeptides (in 100 
microliter voltmies), were then added to the wells (at dilutions of 1%, 0.1% and 0.01 %. respectively). The 

20 cell cultures were incubated for 6-7 days at 37"*C/5% COj. After the incubation, the media in the wells was 
aspirated, and the cells were washed IX with PBS. An acid phosphatase reaction mixmre (100 microliter; 
O.IM sodium acetate, pH 5.5, 0.1 % Triton X-100, 10 mM p-nitrophenyl phosphate) was then added to each 
well. After a 2 hour incubation at 37*C, the reaction was stopped by addition of 10 microliters IN NaOH. 
Optical density (OD) was measured on a microplate reader at 405 nm. 

25 The activity of PRO polypeptides was calculated as the percent inhibition of VEGF (3 ng/ml) 

stimulated proliferation (as determined by measuring acid phosphatase activity at OD 405 nm) relative to the 
cells without stimulation. TGF-beta was employed as an activity reference at 1 ng/ml, since TGF-beta blocks 
70-90% of VEGF-stimulated ACE cell proliferation. The results are indicative of the utility of the PRO 
polypeptides in cancer therapy and specifically in inhibiting tumor angiogenesis. Numerical values (relative 

30 inhibition) are determined by calculating the percent inhibition of VEGF stimulated proliferation by the PRO 
polypeptides relative to cells widiout stimulation and then dividing that percentage into the percent inhibition 
obtained by TGF-p at 1 ng/ml which is known to block 70-90% of VEGF stimulated cell proliferation. The 
results are considered positive if the PRO polypeptide exhibits 30% or greater inhibition of VEGF stimulation 
of endothelial cell growth (relative inhibition 30% or greater). 

35 The following polypeptide tested positive in this assay: PROS 12. 
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EXAMPLE 151 : Stimulatory Activity in Mixed Lymphocyte Reaction (MLK) Assay (Assay 24> 

This example shows that certain polypeptides of the invention are active as a stimulator of the 
proliferation of stimulated T-lymphocytes . Compounds wWch stimulate proliferation of lymphocytes are usefiil 
therapeutically where enhancement of an immune response is beneficial. A therapeutic agent may take the 
form of antagonists of the polypeptide of the invention, for example, murine-human chimeric, humanized or 
5 human antibodies against the polypeptide. 

The basic protocol for this assay is described in Current Protocols in Imraimology, imit 3.12; edited 
by J E Coligan, A M Kniisbeek. D H Marglies, E M Shevach. W Strober, National Insimtes of Health, 
Published by John Wiley & Sons, Inc. 

More specifically, in one assay variant, peripheral blood mononuclear cells (PBMC) are isolated from 
10 mammalian individuals, for example a human volunteer, by leukopheresis (one donor will supply stimulator 
PBMCs, die other donor will supply responder PBMCs). If desired, the cells are frozen in fetal bovine serum 
and DMSO after isolation. Frozen cells may be thawed overnight in assay media (ST'C, 5% CO2) and then 
washed and resuspended to 3x10* cells/ml of assay media (RPMI; 10% fetal bovine serum. 1% 
penicillin/streptomycin, 1% glutamine, 1% HEPES, 1% non-essential amino acids, 1% pyruvate). The 
15 stimulator PBMCs are prepared by irradiating the cells (about 3000 Rads). 

The assay is prepared by plating in triplicate wells a mixture of: 
100:1 of test sample diluted to 1% or to 0.1%, 
50 :1 of irradiated stimulator cells, and 
'50 :1 of responder PBMC cells, 
20 100 microliters of cell culture media or 100 microliter of CD4-lgG is used as the control. The wells are then 
incubated at 37°C^ 5 % COj for 4 days. On day 5, each well is pulsed with tritiated thymidine (1.0 mC/well; 
Amersham). After 6 hours the cells are washed 3 times and then the uptake of the label is evaluated. 

In another variant of this assay, PBMCs are isolated from the spleens of Balb/c mice and C57B6 mice. 
The cells are teased from freshly harvested spleens in assay media (RPMI; 10% fetal bovine serum, 1% 
25 penicillin/streptomycin, 1% glutamine. 1% HEPES, 1% non-essential amino acids, 1% pyruvate) and the 
PBMCs are isolated by overiaying these cells over Lympholyte M (Organon Teknika), centrifuging at 2000 
rpm for 20 minutes, collecting and washing the mononuclear cell layer in assay media and resuspending the 
cells to 1x10^ cells/ml of assay media. The assay is then conducted as described above. 

Positive increases over control are considered positive with increases of greater than or equal to 1 80% 
30 being preferred. However, any value greater than control indicates a stimulatory effect for the test protein. 

The following PRO polypeptides tested positive in this assay: PR0826. PRO1068, PR01184, 
PR01346andPR01375. 

EXAMPLE 152 : Retinal Neuron Survival (Assay 52) 
35 This example demonstrates that certain PRO polypeptides have efficacy in enhancing the survival of 

retinal neuron cells and, therefore, are useful for the therapeutic treatment of retinal disorders or injuries 
including, for example, treating sight loss in mammals due to retinitis pigmentosum, AMD, etc. 
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Sprague Dawley rat pups at postnatal day 7 (mixed population: glia and retinal neuronal types) are 
killed by decapitation following COj anesthesia and the eyes are removed under sterile conditions. The neural 
retina is dissected away from the pigment epithelium and other ocular tissue and then dissociated into a single 
cell suspension using 0.25% trypsin in Ca^^. Mg^*-free PBS. The retinas are mcubated at 37**C for 7-10 
minutes after which the trypsin is inactivated by adding 1 ml soybean trypsin inhibitor. The cells are plated 
5 at 100,000 cells per well in 96 well plates in DMEM/F12 supplemented with N2 and with or without the 
specific test PRO polypeptide. Cells for all experiments are grown at 37*C in a water saturated atmosphere 
of 5% CO2, After 2-3 days in culture, cells are stained with calcein AM then fixed using 4% 
paraformaldehyde and stained with DAPI for detennination of total cell coimt. The total cells (fluorescent) 
are quantified at 20X objective magnification using CCD camera and NIH image software for Macintosh. 
10 Fields in the well are chosen at random. 

The effect of various concentration of PRO polypeptides are reported herein where percent survival 
is calculated by dividing the total number of calcein AM positive cells at 2-3 days in culmrc by the total 
number of DAPI-labeled cells at 2-3 days in culture. Anything above 30% survival is considered positive. 

The following PRO polypeptides tested positive in this assay using polypeptide concentrations within 
15 the range of 0.01% to 1.0% in the assay: PR0828, PR0826, PRO1068 and PROn32. 

EXAMPLE 153 : Rod Photoreceptor Cell Survival (Assay 56) 

This assay shows that certain polypeptides of the invention act to enhance the survival/proliferation 
of rod photoreceptor cells and, therefore, are useful for the therapeutic treatment of retinal disorders or injuries 

20 including, for example, treating sight loss in mammals due to retinitis pigmentosum, AMD, etc. 

Sprague Dawley rat pups at 7 day postnatal (mixed population: glia and retinal neuronal cell types) 
are killed by decapitation following COj anesthesis and the eyes are removed tmder sterile conditions. The 
neural retina is dissected away form the pigment epi±elium and other ocular tissue and then dissociated into 
a single cell suspension using 0.25% trypsin in Ca^*, Mg^*-free PBS. The retinas are incubated at 37''C for 

25 7-10 minutes after which the trypsin is inactivated by adding 1 ml soybean trypsin inhibitor. The cells are 
plated at 100,000 cells per well in 96 well plates in DMEM/F12 supplemented with Nj. Cells for all 
experiments are grown at 37*'C in a water saturated atmosphere of 5% COj. After 2-3 days in culhire, cells 
are fixed using 4% paraformaldehyde, and then stained using CellTrackerOreen CMFDA. Rho 4D2 (ascites 
or IgG 1:100), a monoclonal antibody directed towards the visual pigment rhodopsin is used to detect rod 

30 photoreceptor cells by indirect immunofluorescence. The results are calculated as % survival: total number 
of calcein - rhodopsin positive cells at 2-3 days in culhire, divided by the total number of rhodopsin positive 
cells at time 2-3 days in culture. The total cells (fluorescent) are quantified at 20x objective magnification 
using a CCD camera and NIH image software for MacLitosh. Fields in the well are chosen at random. 

The following polypeptides tested positive in this assay: PR0536, PR0943, PR0828. PR0826, 

35 PRO1068andPRO1132. 
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EXAMPLE 154: Induction of c-fos in Endothelial Cells (Assay 34> 

This assay is designed to determine whether PRO polypeptides show the ability to induce c-fos in 
endothelial cells. PRO polypeptides testing positive in this assay would be expected to be useful for the 
therapeutic treatment of conditions or disorders where angiogenesis would be beneficial including, for example, 
wound healing, and the like (as would agonists of these PRO polypeptides). Antagonists of the PRO 
5 polypeptides testing positive in this assay would be expected to be useful for the ther^utic treatment of 
cancerous tumors. 

Human venous umbilical vein endothelial cells (HUVEC, Cell Systems) in growth media (50% Ham's 
F12 w/o GHT: low glucose, and 50% DMEM without glycine: with NaHC03, 1% glutamine, 10 mM 
HEPES, 10% FBS, 10 ng/ml bFGF) were plated on 96-welI microtiter plates at a cell density of 1x10* 

10 cells/well. The day after plating, the cells were starved by removing (he growth media and treating the cells 
with 100 Ml/well test samples and controls (positive control = growth media; negative control = Protein 32 
buffer = 10 mM HEPES, 140 mM NaCI, 4% (w/v) mannttol, pH 6.8). The cells were incubated for 30 
minutes at 37 "C, in 5 % CO2. The samples were removed, and the first part of the bDNA kit protocol (Chiron 
Diagnostics, cat. #6005-037) was followed, where each capitalized reagent/buffer listed below was available 

15 fit)m the kit. 

Briefly, the amounts of the TM Lysis Buffer and Probes needed for the tests were calculated based 
on information provided by the manufacturer. The appropriate amoimts of thawed Probes were added to the 
TM Lysis Buffer. The Capmre Hybridization Buffer was wanned to room temperature. The bDNA strips 
were set up in the metal strip holders, and 100 ^1 of Capture Hybridization Buffer was added to each b-DNA 

20 well needed, followed by incubation for at least 30 minutes. The test plates with the cells were removed from 
the incubator, and the media was gently removed using the vacuimi manifold. 100 //I of Lysis Hybridization 
Buffer with Probes were quickly pipetted into each well of the microtiter plates. The plates were then 
incubated at 55 "C for 15 minutes. Upon removal from the incubator, the plates were placed on the vortex 
mixer with the microtiter adapter head and vortexed on the #2 setting for one minute. 80 /A of the lysate was 

25 removed and added to the bDNA wells containing the Capture Hybridization Buffer, and pipetted up and down 
to mix. The plates were incubated at 53 ""C for at least 16 hours. 

On the next day, the second part of the bDNA kit protocol was followed. Specifically, the plates 
were removed fi-om the incubator and placed on the bench to cool for 10 minutes. The volumes of additions 
needed were calculated based upon information provided by the manufacturer. An Amplifier Working Solution 

30 was prepared by making a 1 : 100 dilution of the Amplifier Concentrate (20 fm/pd) in AL Hybridization Buffer. 
The hybridization mixture was removed from the plates and washed twice with Wash A. 50 Ail of Amplifier 
Working Solution was added to each well and the wells were incubated at 53 "C for 30 minutes. The plates 
were then removed from the incubator and allowed to cool for 10 minutes. The Label Probe Working Solution 
was prepared by making a 1:100 dilution of Label Concentrate (40 pmoles//il) in AL Hybridization Buffer. 

35 After the 10-minute cool-down period, the amplifier hybridization mixmre was removed and the plates were 
washed twice with Wash A. 50 ^\ of Label Probe Working Solution was added to each well and the wells 
were incubated at 53*C for 15 minutes. After cooling for 10 minutes, the Substrate was warmed to room 
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temperature. Upon addition of 3 ^1 of Substrate Enhancer to each ml of Substrate needed for the assay, the 
plates were allowed to cool for 10 minutes, the label hybridization mixture was removed, and the plates were 
washed twice with Wash A and three times with Wash D. 50 a^I of the Substrate Solution with Enhancer was 
added to each well. The plates were incubated for 30 minutes at 37 "C and RLU was read in an appropriate 
luminometer. 

The replicates were averaged and the coefficient of variation was determined. The measure of activity 
of the fold increase over the negative control (Protein 32/HEPES buffer described above) value was indicated 
by chemiluminescence units (RLU). The results are considered positive if the PRO polypeptide exhibits at least 
a two-fold value over the negative buffer control. Negative control = 1 .00 RLU at 1 .00% dilution. Positive 
control = 8,39 RLU at 1.00% dilution. 

The following PRO polypeptides tested positive in.this assay: PR0535, PR0826, PR0819, PROl 126» 
PROneO and PR01387. 



EXAMPLE 155 : Inhibitory Activity in Mixed Lymphocyte Reaction (MLR) Assav (Assay 67) 

This example shows that one or more of the polypeptides of the invention are active as inhibitors of 
the proliferation of stimulated T-lymphocytes. Compounds which inhibit proliferation of lymphocytes are 
useful therapeutically where suppression of an immune response is beneficial. 

The basic protocol for this assay is described in Current Protocols in Immunology, unit 3. 12; edited 
by J E Coligan, A M Kruisbeek, D H MargHes, E M Shevach, W Strober, National Insitutes of Health, 
Published by John Wiley & Sons, Inc. 

More specifically, in one assay variant, peripheral blood mononuclear cells (PBMC) are isolated from 
mammalian individuals, for example a human volunteer, by leukopheresis (one donor will supply stimulator 
PBMCs, the other donor will supply responder PBMCs). If desired, the cells are frozen in fetal bovine serum 
and DMSO after isolation. Frozen cells may be thawed overnight in assay media (37''C, 5% COj) and then 
washed and resuspended to 3x10* cells/ml of assay media (RPMI; 10% fetal bovine serum, 1% 
penicillin/streptomycin, 1 % glutamine, 1 % HEPES, 1 % non-essential amino acids, 1 % pyruvate). The 
stimulator PBMCs are prepared by irradiating the cells (about 3000 Rads). 

The assay is prepared by plating in triplicate wells a mixture of: 

100:1 of test sample diluted to 1 % or to 0. 1 %, 

50 :1 of irradiated stimulator cells, and 

50 :1 of responder PBMC cells. 
100 microliters of cell culture media or 100 microliter of CD4-IgG is used as the control. The wells are then 
incubated at 37'*C, 5 % COj for 4 days. On day 5, each well is pulsed with tritiated thymidine (1 .0 mC/well; 
Amersham). After 6 hours the cells are washed 3 times and then the uptake of the label is evaluated. 

In another variant of this assay, PBMCs are isolated from the spleens of Balb/c mice and C57B6 mice. 
The cells are teased from freshly harvested spleens in assay media (RPMI; 10% fetal bovine serum, 1% 
penicillin/streptomycin, 1% glutamine, 1% HEPES, 1% non-essential amino acids, \% pyruvate) and the 
PBMCs are isolated by overiaying these cells over Lympholyte M (Organon Teknika), centrifiiging at 2000 
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rpm for 20 minutes, collecting and washing the mononuclear cell layer in assay media and resuspending the 
cells to IxlO' cells/ml of assay media. The assay is then conducted as described above. 

Any decreases below control is considered to be a positive result for an inhibitory compound, with 
decreases of less than or equal to 80% being preferred. However, any value less than control indicates an 
inhibitory effect for the test protein. 
5 The following polypeptide tested positive in this assay: PROl 1 14, PR0836, PROl 159, PR013 12, 

PROl 192. PROl 195 and PR01387. 

EXAMPLE 156 : Mouse Kidnev Mesaneial Cell Proliferation Assay (Assay 92> 

This assay shows that certain polypeptides of the invention act to induce proliferation of mammalian 
kidney mesangial cells and, therefore, are useful for treating kidney disorders associated with decreased 
mesangial cell function such as Berger disease or other nephropathies associated with Schonlein-Henoch 
purpura, 

celiac disease, dermatitis herpetiformis or Crohn disease. The assay is performed as follows. On day one, 
mouse kidney mesangial cells are plated on a 96 well plate in growth media (3:1 mixture of Dulbecco*s 
modified Eagle's medium and Ham's F12 medium, 95% fetal bovine serum, 5% supplemented with 14 mM 
HEPES) and grown overnight. On day 2, PRO polypeptides are diluted at 2 concentrationsCl% and 0.1 %) 
in serum-free medium and added to tiie cells. Control samples are serum-free medium alone. On day 4, 20/ii 
of the Cell Titer 96 Aqueous one solution reagent (Progenia) was added to each well and the colormetric 
reaction was allowed to proceed for 2 hours. The absorbance (OD) is then measured at 490 nm. A positive 
in the assay is anything that gives an absorbance reading which is at least 15% above the control reading. 
The following polypeptide tested positive in this assay: PR0819, PR0813 and PRO1066. 

EXAMPLE 157 : Pericyte c-Fos Induction (Assay 93) 

This assay shows that certain polypeptides of the invention act to induce the expression of c-fos in 
25 pericyte cells and, therefore, are useful not only as diagnostic markers for particular types of pericyte- 
associated tumors but also for giving rise to antagonists which would be expected to be useful for the 
therapeutic treatment of pericyte-associated tumors. Specifically, on day 1 , pericytes are received from VEC 
Technologies and all but 5 ml of media is removed from flask. On day 2, the pericytes are trypsinized, 
washed, spun and then plated onto 96 well plates. On day 7, the media is removed and the pericytes are 
30 treated with 100 ;tl of PRO polypeptide test samples and controls, (positive control = DME+5% serum +/- 
PDGF at 500 ng/ml; negative control = protein 32). Replicates are averaged and SD/CV are determined. 
Fold increase over Protein 32 (buffer control) value indicated by chemiluminescence units (RLU) luminometer 
reading verses frequency is plotted on a histogram. Two-fold above Protein 32 value is considered positive for 
the assay. ASY Matrix: Growth media = low glucose DMEM = 20% FBS + IX pen strep + IX fungizone. 
35 Assay Media = low glucose DMEM +5% FBS. 

The following polypeptides tested positive in this assay: PR0943 and PROS 19. 
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EXAMPLE 158: Detection of PRO Polvpentides That Affect Glucose or FFA Uptake by Primary Rat 
Adipocytes (Assay 94) 

This assay is designed to determine whether PRO polypeptides show the ability to affect glucose or 
FFA uptake by adipocyte cells. PRO polypeptides testing positive in this assay would be expected to be useful 
for the therapeutic treatment of disorders where either the stimulation or inhibition of glucose uptake by 
5 adipocytes would be beneficial including » for example, obesity, diabetes or hyper- or hypo-insulinemia. 

In a 96 well format, PRO polypeptides to be assayed are added to primary rat adipocytes, and allowed 
to incubate overnight. Samples are taken at 4 and 16 hours and assayed for glycerol, glucose and FFA uptake. 
After the 16 hour incubation, insulin is added to the media and allowed to incubate for 4 hours. At this time, 
a sample is taken and glycerol, glucose and FFA uptake is measured. Media containing insulin without the 
10 PRO polypeptide is used as a positive reference control. As the PRO polypeptide being tested may either 
stimulate or inhibit glucose and FFA uptake, results are scored as positive in the assay if greater than 1 .5 times 
or less than O.S times the insulin control. 

The following PRO polypeptides tested positive as stimulators of glucose and/or FFA uptake in this 
assay: PR0U14. PRO1007, PRO1066, PR0848, PR01182, PROU98, PROI192, PR01271, PR01375 and 
15 PR01387. 

The following PRO polypeptides tested positive as inhibitors of glucose and/or FFA uptake in this 
assay: PROn84, PRO1360. PRO1309, PR01154. PR0118I. PR01186, PRO1160 and PR01384. 

EXAMPLE 159 : Chondrocyte Re-differentiation Assay (Assay 1 10) 

20 This assay shows that certain polypeptides of the invention act to induce redifferentiation of 

chondrocytes, therefore, are expected to be useful for the treatment of various bone and/or canilage disorders 
such as, for example, sports injuries and arthritis. The assay is performed as follows. Porcine chondrocytes 
are isolated by overnight coUagenase digestion of articulary cartilage of metacarpophalangeal joints of 4-6 
month old female pigs. The isolated cells are then seeded at 25,000 cells/cm^ in Ham F-12 containing 10% 

25 FBS and 4 /xg/ml gentamycin. The culture media is changed every third day and the cells are then seeded in 
96 well plates at 5,000 cells/well in lOO/il of the same media without serum and 100 /il of the test PRO 
polypeptide, 5 nM staurosporin (positive control) or medium alone (negative control) is added to give a final 
volume of 200 /tl/well. After 5 days of incubation at 37*C a picmre of each well is taken and the 
differentiation state of the chondrocytes is determined. A positive result m the assay occurs when the 

30 redifferentiation of the chondrocytes is determined to be more similar to the positive control than the negative 
control. 

The following polypeptide tested positive in this assay: PR01282, PRO1310, PR0619, PR0943. 
PRO820, PRO1080. PRO1016, PRO1007, PRO1056, PR0791, PROllll, PR01184, PRO1360, PRO1309, 
PRO1107. PR01132, PROnai, PR0848, PR0118I, PR01186, PR01159, PR01312, PR01192 and 
35 PR0I384. 



530 



wo 00/73454 




PCTAJSOO/08439 



EXAMPLE W: Chondrocyte Proliferation Assay (Ass^y 1 1 1) 

This assay is designed to determine whether PRO polypeptides of the present invention show the 
ability to induce the proliferation and/or redifferentiation of chondrocytes in culture. PRO polypeptides testing 
positive in this assay would be expected to be useful for the therapeutic treatment of various bone and/or 
cartilage disorders such as, for example, sports injuries and arthritis. 
5 Porcine chondrocytes are isolated by overnight collagenase digestion of articular cartilage of the 

metacarpophalangeal joint of 4-6 month old female pigs. The isolated cells are then seeded at 25,000 cells/cm^ 
in Ham F-12 containing 10% FBS and 4 /ig/ml gentamycin. The culnire media is changed every third day and 
the cells are reseeded to 25,000 cells/cm^ every five days. On day 12, the cells are seeded in 96 well plates 
at 5,000 cells/well in lOO^il of the same media without serum and 100 ^il of either scrum-free medium (negative 

10 control), staurosporin (final concentration of 5 nM; positive control) or the test PRO polypeptide are added 
to give a final volume of 200 /tl/well. After 5 days at 37*C, 20 fi\ of Alamar blue is added to each well and 
the plates are incubated for an additional 3 hours at 37**C. The fluorescence is then measured in each well 
(Ex:530 nm; Em: 590 nm). The fluorescence of a plate containmg 200 /il of the serum-free medium is 
measured to obtain the background. A positive result in the assay is obtained when the fluorescence of the 

15 PRO polypeptide treated sample is more like that of the positive control than the negative control. 

The following PRO polypeptides tested positive in this assay: PRO1310, PR0844, PR01312, 
PR01192 and PR01387. 

EXAMPLE 161 : Induction of Pancreatic B-Celi Precursor Proliferation (Assav 117) 

20 This assay shows that certain polypeptides of the invention act to induce an increase in the nimiber 

of pancreatic p-cell precursor cells and, therefore, are MseM for treating various insulin deficient states in 
mammals, including diabetes mellitus. The assay is performed as follows. The assay uses a primary culture 
of mouse fetal pancreatic cells and the primary readout is an alteration in the expression of markers that 
represent either P-cell precursors or mature P-cells. Marker expression is measured by real time quantitative 

25 PGR (RTQ-PCR); wherein the marker being evaluated is a U-anscription factor called Pdxl . 

The pancreata are dissected from E14 embryos (GDI mice). The pancreata are then digested with 
collagenase/dispase in F12/DMEM at 3TC for 40 to 60 minutes (collagenase/dispase, 1 .37 mg/ml, Boehringer 
Mannheim. #1097113). The digestion is then neutralized with an equal volume of 5% BSA and the cells are 
washed once with RPMI1640. At day I , the cells are seeded into 12- well tissue culUire plates (pre-coated with 

30 laminin, 20;tg/ml in PBS, Boehringer Mannheim, #124317). Cells from pancreata from 1-2 embryos are 
distributed per well. The culture medium for this primary cuture is 14F/1 640. At day 2, the media is removed 
and the attached cells washed with RPMI/1640. Two mis of minimal media are added in addition to the protein 
to be tested. At day 4, the media is removed and RNA prepared from the cells and marker expression analyzed 
by real time quantitative RT-PCR. A protein is considered to be active in the assay if it increases the expression 

35 of the relevant p-cell marker as compared to untreated controls. 
14F/1640 is RPMI1640 (Gibco) plus the following: 
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group A 1:1000 
group B 1:1000 

recombinant human insulin 10 /ig/ml 

Aprotinin (50/<g/ml) 1:2000 (Boehringer manheim #981532) 

Bovine pituitary extract (BPE) 60;tg/ml 

Gentamycin 100 ng/ml 
Group A : (in 10ml PBS) 

Transferrin, lOOmg (Sigma T2252) 

Epidermal Growth Factor, lOO/xg (BRL 100004) 

Triiodothyronine, 10^ of 5x10* M (Sigma T5516) 

Ethanolamine, lOO/il of 10"' M (Sigma E0135) 

Phosphoethalamine, 100;tl of 10 ' M (Sigma P0503) 

Selenium, 4/a of 10"' M (Aesar #12574) 
Group C : (in 10ml 100% ethanol) 

Hydrocortisone, 2fd of 5X10-^ M (Sigma #H0135) 

Progesterone, 100;tl of 1X10"^ M (Sigma #P6149) 

Forskolin, SOO/U of 20mM (Calbiochem #344270) 
Minimal media: 

RPMI 1640 plus transferrin (10 n^ml), insulin (1 /ig/ml), gentamycin (100 ng/ml), aprotinin (50 
/ig/ml) and BPE (15 /tg/ml). 
Defined media: 

RPMI 1640 plus transferrin (10 Atg/ml), insulin (1 /ig/ml), gentamycin (100 ng/ml) and aprotinin (50 

The following polypeptides tested positive in this assay: PRO1310, PR01188, PR01131 and 
PR01387. 

EXAMPLE 162 : Detection of Polvt)eptides That Affect Glucose or FFA Uptake in Skeletal Muscle (Assay 
106) 

This assay is designed to determine whether PRO polypeptides show the ability to affegt glucose or 
FFA uptake by skeletal muscle cells. PRO polypeptides testing positive in this assay would be expected to be 
useful for the therapeutic treatment of disorders where either the stimulation or inhibition of glucose uptake 
by skeletal muscle would be beneficial including, for example, diabetes or hyper- or hypo-insulinemia. 

In a 96 well format, PRO polypeptides to be assayed are added to primary rat differentiated skeletal 
muscle, and allowed to incubate overnight. Then fresh media with the PRO polypeptide and +/- insulin are 
added to the wells. The sample media is then monitored to determine glucose and FFA uptake by the skeletal 
muscle cells. The insulin will stimulate glucose and FFA uptake by the skeletal muscle, and insulin in media 
without the PRO polypeptide is used as a positive control, and a limit for scoring. As the PRO polypeptide 
being tested may either stimulate or inhibit glucose and FFA uptake, results are scored as positive in the assay 
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if greater than 1 .5 times or less than 0.5 times the insulin control. 

The following PRO polypeptides tested positive as either stimulators or inhibitors of glucose and/or 
FFA uptake in this assay: PR0358. PRO1016. PRO1007, PR0826» PRO1066. PRO1029 and PRO1309. 

EXAMPLE 163 : Fetal Hemoglobin Induction in an Erythr oblastic Cell Line (Assay 107) 

This assay is useful for screening PRO polypeptides for the ability to induce the switch from adult 
hemoglobin to fetal hemoglobin in an erythroblastic cell line. Molecules testing positiye in diis assay are 
expected to be useful for therapeutically treating various mammalian hemoglobin-associated disorders such as 
the various thalassemias. The assay is performed as follows. Erythroblastic cells are plated in standard growth 
medium at 1000 cells/well in a 96 well format. PRO polypeptides are added to the growth medium at a 
concentration of 0.2% or 2% and the cells are incubated for 5 days at 3TC. As a poshive control, cells are 
treated with lOO^tM hemin and as a negative control, the cells are untreated. After 5 days, cell lysates are 
prepared and analyzed for the expression of gamma globin (a fetal marker). A positive in the assay is a gamma 
globin level at least 2-fold above the negative control. 

The following polypeptides tested positive in this assay: PROl 1 14, PR0826. PRO1066, PR0844, 
PR01192 and PR01358. 

EXAMPLE 164 : Induction of Pancreatic P-Cell Precursor Differentiation (Assay 89) 

This assay shows that certain polypeptides of the invention act to induce differentiation of pancreatic 
p-cell precursor cells into mature pancreatic p-cells and, therefore, are useful for treating various insulin 
deficient states in manunals, including diabetes mellitus. The assay is performed as follows. The assay uses 
a primary culture of mouse fetal pancreatic cells and the primary readout is an alteration in the e^qiression of 
markers that represent either p-cell precursors or mature p-cells. Marker expression is measured by real lime 
quantitative PGR (RTQ-PCR); wherein the marker being evaluated is insulin. 

The pancreata are dissected from E14 embryos (CDl mice). The pancreata are then digested with 
coUagenase/dispase in F12/DMEM at 37°C for 40 to 60 minutes (collagenase/dispase, 1 .37 mg/ml, Boehringer 
Mannheim, 0 10971 13). The digestion is then neutralized with an equal volume of 5 % BSA and the cells are 
washed once with RPMI1640. At day 1 , the cells are seeded into 12-well tissue culture plates (pre-coated with 
laminin, 20Mg/ml in PBS, Boehringer Mannheim, #124317). Cells from pancreata from 1-2 embryos are 
distributed per well. The culture medium for this primary cuture is 14F/1640. At day 2, the media is removed 
and the attached cells washed with RPMI/ 1640, Two mis of minimal media are added in addition to the protein 
to be tested. At day 4, the media is removed and RNA prepared from the cells and marker expression analyzed 
by real time quantitative RT-PCR. A protein is considered to be active in the assay if it increases the expression 
of the relevant p-cell marker as compared to untreated controls. 
14F/1640 is RPM11640 (Gibco) plus the following: 

group A 1:1000 

group B 1:1000 

recombinant human insulin 10 ^tg/ml 
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Aprotinin (SOfig/ml) 1 :2000 (Boehringer manheim #981532) 

Bovine pituitary extract (BPE) 60;tg/ml 

Gentamycin 100 ng/ml 
Group A : (in lOml PBS) 

Transferrin, lOOmg (Sigma T2252) 

Epidermal Growtii Factor, lOO/ig (BRL 100004) 

Triiodothyronine, \Ofi\ of 5x 10* M (Sigma T55 1 6) 

Ethanolamine. lOO/il of lO"' M (Sigma E0135) 

Phosphoethalamine, lOO/il of 10 ' M (Sigma P0503) 

Selenium, 4^tl of lO'' M (Aesar #12574) 
Group C : (in lOml 100% ethanol) 

Hydrocortisone, IfA of 5X10*^ M (Sigma #H0135) 

Progesterone, 100^1 of IXIO"' M (Sigma #P6149) 

Forskolin, 500^1 of 20niM (Calbiochem #344270) 
Minimal media: 

RPMI 1640 plus transferrin (10 fig/ml), insulin (1 ^ig/ml), gentamycin (100 ng/ml), aprotinin (50 
/tg/ml) and BPE (15 /ig/ml). 
Defined media: 

RPMI 1640 plus transferrin (10 fig/rai), insulin (1 Mg^ml), gentamycin (100 ng/ml) and aprotinin (50 

flg/Tttl). 

The following polypeptides were positive in this assay: PR01188, PR01132, PR01131 and 
PR01181. 

EXAMPLE 165 : Skin Vascular Permeabilitv Assav (Assay 64) 

This assay shows that certain polypeptides of the invention stimulate an immune response and induce 
inflammation by inducing mononuclear cell, eosinophil and PMN infiltration at the site of injection of the 
animal. Compounds which stimulate an immune response are useful therapeutically where stimulation of an 
immune response is beneficial. This skin vascular permeability assay is conducted as follows. Hairless guinea 
pigs weighing 350 grams or more are anesthetized with ketamine (75-80 mg/Kg) and 5 mg/Kg xylazine 
intramuscularly (IM). A sample of purified polypeptide of the invention or a conditioned media test sample 
is injected intradermally onto the backs of the test animals with 100 n\ per injection site. It is possible to have 
about 10-30, preferably about 16-24, injection sites per animal. One /il of Evans blue dye (1 % in physiologic 
buffered saline) is injected intracardially. Blemishes at the injection sites are then measured (mm diameter) 
at 1 hr and 6 hr post injection. Animals were sacrificed at 6 hrs after injection. Each skin injection site is 
biopsied and fixed in formalin. The skins are then prepared for histopathologic evaluation. Each site is 
evaluated for inflammatory cell infiltration into the skin. Sites with visible inflammatory cell inflammation are 
scored as positive. Inflammatory cells may be neutrophilic, eosinophilic, monocytic or lymphocytic. At least 
a minimal perivascular infiltrate at the injection site is scored as positve, no infiltrate at the site of injection is 
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scored as negative. 

The following polypeptide tested positive in this assay: PRO1007, PR01358 and PR01375. 

EXAMPLE 166 : Induction of Endothelial Cell Apon tosis (ELISA^ (Assay 109^ 

The ability of PRO polypeptides to induce apoptosis in endothelial cells was tested in human venous 
5 umbilical vein endothelial cells (HUVEC, Cell Systems) using a 96-well format, in 0% serum media 
supplemented widi ICX) ng/ml VEGF, 0. 1 % BSA, IX penn/strep. A positive result in this assay indicates the 
usefulness of the polypeptide for therapeutically treating any of a variety of conditions associated with 
imdesired endothelial cell growth including, for example, the inhibition of tumor growth. The 96-well plates 
used were manufacmred by Falcon (No. 3072). Coating of 96 well plates were prepared by allowing 

10 gelatinization to occur for >30 minutes with 100 /zl of 0.2% gelatin in PBS solution. The gelatin mix was 
aspirated thoroughly before plating HUVEC cells at a final concentration of 2 x 10* cells/ml in 10% serum 
containing medium - 100 fA volume per well. The cells were grown for 24 hours before adding test samples 
contaming the PRO polypeptide of interest. 

To all wells, 100 fj.\ of 0% serum media (Cell Systems) complemented with 100 ng/ml VEGF, 0.1 % 

15 BSA, IX penn/strep was added. Test samples containing PRO polypeptides were added in triplicate at 
dilutions of 1 % , 0.33 % and 0. 1 1 % . Wells without cells were used as a blank and wells with cells only were 
used as a negative control. As a positive control, 1 :3 serial dilutions of 50 fA of a 3x stock of staurosporine 
were used. The cells were incubated for 24 to 35 hours prior to ELISA. 

ELISA was used to determine levels of apoptosis preparing solutions according to the Boehringer 

20 Manual [Boehringer, Cell Death Detection ELISA plus, Cat No. 1 920 685]. Sample preparations: 96 well 
plates were spun down at 1 krpm for 10 minutes (200g); the supernatant was removed by fast inversion, 
placing the plate upside down on a paper towel to remove residual liquid. To each well, 200 ^1 of IX Lysis 
buffer was added and incubation allowed at room temperature for 30 minutes without shaking. The plates were 
spun down for 10 minutes at 1 krpm, and 20 ti\ of the lysate (cytoplasmic fraction) was transferred into 

25 streptavidin coated MTP. 80 fA of immunoreagent mix was added to the 20 fi\ lystate in each well. The MTP 
was covered with adhesive foil and incubated at room tempearature for 2 hours by placing it on an orbital 
shaker (200 rpm). After two hours, the supernatant was removed by suction and the wells rinsed three times 
with 250 /il of IX incubation buffer per well (removed by suction). Substrate solution was added (100 m1) into 
each well and incubated on an orbital shaker at room temperanire at 250 rpm until color development was 

30 sufficient for a photometric analysis (approx. after 10-20 minutes). A 96 well reader was used to read the. 
plates at 405 nm, reference wavelength, 492 ran. The levels obtained for PIN 32 (control buffer) was set to 
100%. Samples with levels > 130% were considered positive for induction of apoptosis. 
The following PRO polypeptides tested positive in this assay: PR0844. 

35 EXAMPLE 167 : Guinea Pig Vascular Leak (Assay 32> 

This assay is designed to determine whether PRO polypeptides of the presem invention show the 
ability to induce vascular permeability. Polypeptides testing positive in this assay are expected to be useful 
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10 



15 



20 



25 



30 



for the therapeutic treatment of conditions which would benefit fronn enhanced vascular permeability including, 
for example, conditions which may benefit from enhanced local immune system cell infiltration. 



and 5 mg/kg Xylazine intramuscularly. Test samples containing the PRO polypeptide or a physiological buffer 
without the test polypeptide are injected into skin on the back of the test animals with 100 per injection site 
intradennally. There were approximately 16-24 injection sites per animal. One ml of Evans blue dye (1 % 
in PBS) is then injected intracardially. Skin vascular permeability responses to the compounds (i.e. , blemishes 
at the injection sites of injection) are visually scored by measuring the diameter (in mm) of blue-colored leaks 
from the site of injection at 1, 6 and 24 hours post administration of the test materials. The nrni diameter of 
blueness at the site of injection is observed and recorded as well as the severity of the vascular leakage. 
Blemishes of at least 5 mm in diameter are considered positive for the assay when testing purified proteins, 
being indicative of the ability to induce vascular leakage or permeability. A response greater than 7 mm 
diameter is considered positive for conditioned media samples. Human VEGF at 0. 1 /zg/IOO ^1 is used as a 
positive control, inducing a response of 4-8 nmi diameter. 

The following PRO polypeptides tested positive in this assay:PR01 155. 

EXAMPLE 168 : Mouse Mesengial Cell Inhibition Assay (Assay 1 14^ 

This assay is designed to determine whedier PRO polypeptides of the present invention show the 
ability to inhibit the proliferation of mouse mesengial cells in culture. PRO polypeptides testing positive in 
this assay would be expected to be useful for the therapeutic treatment of such diseases or conditions where 
inhibition of mesengial cell proliferation would be beneficial such as, for example, cystic renal dysplasia, 
polycystic kidney disease, or other kidney disease assoiciated with abnormal mesengial cell proliferation, renal 
tumors, and the like. 

On day 1, mouse mesengial cells are plated on a % well plate in growth medium (a 3; 1 mixture of 
Dulbecco's modified Eagle's medium and Ham*s F12 medium, 95%; fetal bovine serum, 5%; supplemented 
with 14mM HEPES) and then are allowed to grow overnight. On day 2, the PRO polypeptide is diluted at 2 
different concentrations (1 % , 0.1 %) in serum-free medium and is added to the cells. The negative control is 
growth medium without added PRO polypeptide. After the cells are allowed to incubate for 48 hours, 20 fi\ 
of the Cell Titer 96 Aqueous one solution reagent (Promega) is added to each well and the colormetric reaction 
is allowed to proceed for 2 hours. The absorbance (OD) is then measured at 490 nm. A positive in the assay 
is an absorbance reading which is at least 10% above the negative control. 

The following PRO polypeptides tested positive in this assay: PROl 192 and PROl 195. 

Examnle 169 : In Vitro Antitumor Assay (Assay 161) 

The antiproliferative activity of various PRO polypeptides was determined in the investigational, 
disease-oriented in vitro anti-cancer drug discovery assay of the National Cancer Instimte (NCI), using a 
sulforhodamine B (SRB) dye binding assay essentially as described by Skehan et al., J. Natl. Cancer Inst. 
82: 1 107-1 1 12 (1990). The 60 tumor cell lines employed in this smdy ("the NCI panel"), as well as conditions 



Hairiess guinea pigs weighing 350 grams or more were anesthetized with Ketamine (75-80 mg/kg) 
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for their maintenance and culture in vitro have been described by Monks et al., J. Natl. Cancer Inst. 83:757- 
766 (1991). The purpose of this screen is to initially evaluate the cytotoxic and/or cytostatic activity of the test 
compounds against different types of tumors (Monks et al. , supra; Boyd, Cancer: Princ. Pract. O ncol. Update 
3(10):1-12 [1989]). 

Cells from approximately 60 human nimor cell lines were harvested with trypsin/EDTA (Gibco), 
5 washed once, resuspended in IMEM and their viability was determined. The cell suspensions were added by 
pipet (100 volume) into separate 96-well microtiter plates. The cell density for the 6-day incubation was 
less than for the 2-day incubation to prevent overgrowth. Inoculates were allowed a preincubation period of 
24 hours at 37*'C for stabilization. Dilutions at twice the intended test concentration were added at time zero 
in 100 fiL aliquots to the microtiter plate wells (1:2 dilution). Test compounds were evaluated at five half-log 
10 dilutions (1000 to 100,000-fold). Incubations took place for two days and six days in a 5 % COj atmosphere 
and 100% humidity. 

After incubation, the medium was removed and the cells were fixed in 0. 1 ml of 10% trichloroacetic 
acid at 40''C. The plates were rinsed five times with deionized water, dried, stained for 30 minutes with 0. 1 
ml of 0.4% sulforhodamine B dye (Sigma) dissolved in 1 % acetic acid, rinsed four times with 1 % acetic acid 

15 to remove unbound dye, dried, and the stain was extracted for five minutes with 0. 1 ml of 10 mM Tris base 
[lris(hydroxymethyl)aminomethane], pH 10.5. The absorbance (OD) of sulforhodamine B at 492 nm was 
measured using a computer-interfaced, 96-well microtiter plate reader. 

A test sample is considered positive if it shows at least 50% growth inhibitory effect at one or more 
concentrations. The results are shown in the following table, where the abbreviations are as follows: 

20 NSCL — non-small cell lung carcinoma 
CNS = central nervous system 



Table 7 





Test comoound 


Concentration 


Davs 


Tumor Cell Line Tvoe 


Cell Line 


25 








Designation 




PRO1016 


0.1 nM 


2 


Leukemia 


K-568 




PRO1016 


0.1 nM 


2 


Leukemia 


MOLT-4 




PRO1016 


0.1 nM 


2 


Leukemia 


RPMI-8226 




PRO1016 


0.1 nM 


2 


NSCL 


A549/ATCC 


30 


PRO1016 


0.1 nM 


2 


NSCL 


EKVX 




PRO1016 


0.1 nM 


2 


NSCL 


NCI-H23 




PRO1016 


0.1 nM 


2 


■ NSCL 


NCI-H522 




PRO1016 


0.1 nM 


2 


Colon 


KM-12 




PRO1016 


0.1 nM 


2 


CNS 


SF-295 


35 


PRO1016 


0.1 nM 


2 


Melanoma 


SK-MEL-5 




PRO1016 


0.1 nM 


2 


Melanoma 


UACC-257 




PRO1016 


0.1 nM 


2 


Ovarian 


OVCAR-3 




PRO1016 


0.1 nM 


2. 


Ovarian 


OVCAR^ 




PRO1016 


0.1 nM 


2 


Breast 


NCI/SDR-RES 


40 


PRO1016 


0.1 nM 


2 


Breast 


T-47D 




PRO1016 


0.1 nM 


6 


Leukemia 


CCRF-CEM 




PRO1016 


0.1 nM 


6 


Leukemia 


K-562 
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Tablfi 7 (pQm') 





Test compound 


Concentration 


Days 


Tumor Cell Line Tvpe 


Cell Line 










Desienation 




PRO1016 


0.1 nM 


6 


Leukemia 


MOLT-4 




PRO1016 


0.1 nM 


6 


Leukemia 


RPMI-8226 


5 


PRO1016 


0.1 nM 


6 


NSCL 


A549/ATCC 




PRO1016 


O.l nM 


6 


NSCL 


EKVX 




PRO1016 


0.1 nM 


6 


NSCL 


HOP-62 




PRO1016 


0.1 nM 


6 


NSCL 


NCI-H23 




PROI016 


0.1 nM 


6 


NSCL 


NCI-H322M 


10 


PRO1016 


0.1 nM 


6 


NSCL 


NCI-H460 




PRO1016 


0.1 nM 


6 


NSCL 


NCI-H522 




PRO1016 


0.1 nM 


6 


Colon 


COLO 205 




PRO1016 


0.1 nM 


6 


Colon 


CHT-116 




PRO1016 


0.1 nM 


6 


Colon 


HCT-15 


15 


PRO1016 


0,1 nM 


6 


Colon 


HT-29 




PRO1016 


0.1 nM 


6 


Colon 


SW-620 




PRO 10 16 


0.1 nM 


6 


CNS 


SF-295 




PRO1016 


O.l nM 


6 


CNS 


SF-539 




PRO1016 


0.1 nM 


6 


CNS 


SNB-19 


20 


PRO1016 


0.1 nM 


6 


CNS 


U251 




PRO1016 


0.1 nM 


6 


Melanoma 


LOX IMVI 




PRO1016 


O.l nM 


6 


Melanoma 


MALME-3M 




PRO1016 


0.1 nM 


6 


Melanoma 


SK-MEL-28 




PRO1016 


0.1 nM 


6 


Melanoma 


SK-MEL-5 


25 


PRO1016 


0.1 nM 


6 


Melanoma 


UACC-257 




PRO1016 

^ IXXtf/XVXV 


0.1 nM 


6 


Melanoma 


UACC-62 




PRO1016 


0.1 nM 


6 


Ovarian 


IGROVl 




PRO1016 

X AX X^ A \f A \J 


0.1 nM 


6 


Ovarian 


OVCAR-3 




PRO1016 

* iXX^ 1 \/ 1 


0,1 nM 


6 


Ovarian 


OVCAR-4 


30 


PRO1016 


0.1 nM 


6 


Ovarian 


OVCAR-8 




PRO1016 

* aXXf^AN/XV 


0.1 nM 


6 


Renal 


ACHN 




PRO1016 

& AXX^ A \J A \^ 


0.1 nM 


6 


Renal 


RXF393 




PRO1016 


0.1 nM 


6 


Renal 


SN12C 




PRO1016 


0.1 nM 


6 


Renal 


TK-10 


35 


PRO1016 

A A^ * \^ A W 


O.l nM 


6 


Prostate 


PC-3 




PRO1016 

■» XX X^ A V i V 


O.l nM 


6 


Breast 


MCF-7 




PRO 1016 


0.1 nM 


6 


Breast 


NCl/ADR-RES 




PRO 1 016 


0.1 nM 


6 


Breast 


MDA-MB-231 




PRO 10 16 


O.l nM 


6 


Breast 


MDA-MB-435 


40 


PRO 10 16 


0.1 nM 


6 


Breast 


MDA-N 




PRO 10 16 


0.1 nM 


6 


Breast 


BT-549 




PRO1016 


0.1 nM 


6 


Breast 


T-47D 




PRO 1186 


95 nM 


2 


NSCL 


NCI-H226 




PR01186 


95 nM 


2 


Colon 


Colo205 


45 


PRO 11 86 


2.2 nM 


6 


Breast 


MDA-N 




PROl 186 


114 nM 


2 


NSCL 


NCI-H322M 




PR01186 


114nM 


2 


CNS 


SF-268; SF-539 




PROl 186 


114 nM 


2 


Ovarian 


IGFOVl 




PROl 186 


114 nM 


2 


Renal 


786-0; SN12C; TK- 


50 










10 




PROl 186 


114nM 


6 


Leukemia 


MOLT-4; RPMI- 
8226 




PROU86 


114nM 


6 


Melanoma 


LOX IMVI 




PROl 186 


114 nM 


6 


Ovarian 


OVCAR-4; SK-OV- 
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Tab)^ 7 (cont') 



Test compound 


Concentration 


Days 


Tumor Cell Line Tvpe 


Cell Line 










Designation 


PR01186 


ll4nM 


6 


Breast 


MDA-MB-435; T- 










47D 


PR01186 


8.1 nM 


6 


Leukemia 


K-562 


PRonss 


8.1 nM 


6 


NSCL 


HOP-62 


PR01186 


8.1 nM 


6 


Colon 


Colo205; HCC-2998 


PR01186 


8.1 nM 


6 


Breast 


T^7D 


PRO 11 86 


15.4 nM 


6 


Leukemia 


K-562 


PR01186 


3.6 nM 


2 


Ovarian 


OVCAR-3 


PR01186 


3.6 nM 


6 


NSCL 


HOP-62 



The results of these assays demonstrate thai the positive testing PRO polypeptides are useful for 
inhibiting neoplastic growth in a number of different tumor cell types and may be used dierapeutically 
15 therefor,. Antibodies against these PRO polypeptides are useful for affinity purification of these useful 
polypeptides. Nucleic acids encoding these PRO polypeptides are useful for the recombinant preparation of 
these polypeptides. 

EXAMPLE 170 : Gene Amplification in Tumors 

20 This example shows that certain PRO polypeptide-encoding genes are amplified in the genome of 

certain human lung, colon and/or breast cancers and/or cell lines. AmplifiQation is associated with 
overexpression of the gene product, indicating that the polypeptides are useful targets for therapeutic 
intervention in certain cancers such as colon, lung, breast and other cancers and diagnostic determination of 
the presence of those cancers. Therapeutic agents may take the form of antagonists of the PRO polypeptide, 

25 for example, murine-human chimeric, humanized or human antibodies against a PRO polypeptide. 

The starting material for the screen was genomic DNA isolated from a variety cancers. The DNA 
is quantitated precisely, e.g., fluorometrically. As a negative control, DNA was isolated from the cells often 
normal healthy individuals which was pooled and used as assay controls for the gene copy in healthy 
individuals (not shown). The 5' nuclease assay (for example, TaqMan™) and real-time quantitative PGR (for 

30 example, ABI Prizm 7700 Sequence Detection System™ (Perkin Elmer, Applied Biosystems Division, Foster 
City, CA)), were used to find genes potentially amplified in certain cancers. The results were used to 
determine whether the DNA encoding the PRO polypeptide is over-represented in any of the primary lung or 
colon cancers or cancer cell lines or breast cancer cell lines that were screened. The primary lung cancers 
were obtained from individuals with tumors of the type and stage as indicated in Table 8. An explanation of 

35 the abbreviations used for the designation of the primary tumors listed in Table 8 and the primary tumors and 
cell lines referred to throughout this example are given below. 

The results of the TaqMan"^" are reported in delta (A) Ct units. One unit corresponds to 1 PCR cycle 
or approximately a 2-foId amplification relative to normal, two units corresponds to 4-fold, 3 imits to 8-fold 
amplification and so on. (Quantitation was obtained using primers and a TaqMan™ fluorescent probe derived 

40 from the PRO polypeptide-encoding gene. Regions of the PRO polypeptide-encoding gene "wbida. are most 
likely to contain unique nucleic acid sequences and which are least likely to have spliced out introns are 
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preferred for the primer and probe derivation, e.g., 3 '-untranslated regions. The sequences for the primers 
and probes (forward, reverse and probe) used for the PRO polypeptide gene amplification analysis were as 
follows: 

PRO290(DNA3S680-1212): 
35680.tm.D: 

5 5'-CCACCAATGGCAGCCCCACCT-3' (SEQ ID NQ:428) 

35680.tm.f : 

5'-GACTGCCCTCCCTGCCA-3' (SEQ ID NO:429) 

35680.tm.r : 

5'.CAAAAAGCCTGCAAGTCTTCAAAG-3' (SEQ ID NO:430) 

10 

PR0341 (DNA26288-1239^ : 
26288.tm.fl : 

5'-CAGCTGGACTGCAGGTGCTA-3' 
26288.tm.rl: 

15 5'-CAGTGAGCACAGCAAGTGTCCT-3* 
26288.tm.pl : 

5*-GGCCACCTCCTTGAGTCTTCAGTTCCCT-3' 

PR0535 (DNA49143-1429> : 
20 49143. tm.fl : 

5'-CAACTACTGGCTAAAGCTGGTGAA-3 * 
49143.tm.rl: 

5'-CCTTTCTGTATAGGTGATACCCAATGA-3' 
49143.tm.pl : 

25 5*-TGGCCATCCCTACCAGAGGCAAAA-3* 

PR0619 (DNA4982I-1562) : 
49321,tm,f^ : 

5'-CTGA AGACGACGCGGATTACTA-3 ' 
30 49821.tm.rl : 

5*-GGCAGAAATGGGAGGCAGA-3' 
49821.tm.pl : 

5'-TGCTCTGTTGGCTACGGCnTAGTCCCTAG-3* 

35 PRO809 fDNA57836-1338^ : 
g783^,tm,n: 

5'-AGCAGCAGCCATGTAGAATGAA-3' (SEQ ID NO:440) 



(SEQ ID NO:431) 
(SEQ ID NO:432) 
(SEQ ID NO:433) 



(SEQ ID NO:434) 
(SEQ ID NO:435) 
(SEQ ID NO:436) 



(SEQ ID NO:437) 
(SEQ ID NO:438) 
(SEQ ID NO:439) 
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57836.tm.rl : 

5-AATACGAACAGTGCACGCTGAT-3* 
S7836.tm.Dl: 

5 • -TCCAGAGAGCC A AGCACGGC AGA-3 ' 

5 PHO830fDNAS6866-1342): 
56866.tm.fl : 

5'-TCTAGCCAGCTTGGCTCCAATA-3' 
g^g66.tm.rl : 

5'-CCTGGCTCTAGCACCAACTCATA-3' 
10 56866.tm.Dl : 

5'-TCAGTGGCCCTAAGGAGATGGGCCT-3' 

Pfi0848(DNAS9839-1461>: 
59839.tm.fl: 

15 5*-CAGGATACAGTGGGAATCTTGAGA-3' 
59839.tm.rl: 

5*-CCTGAAGGGCTTGGAGCTTAGT-3' 
?983P,tm,pl: 

5 *-TCTTTGGCC ATTTCCCATGGCTCA-3 ' 

20 

PR0943 (DNA52192-1369): 
52192.tm.fl : 

5'-CCCATGGCGAGGAGGAAT-3' 
52192.tm.rl : 
25 5'-TGCGTACGTGTGCCTTCAG-3' 
52192.tm.Dl : 

5*-CAGCACCCCAGGCAGTCTGTGTGT-3' 

PRO1005 (DNA57708-14in : 
30 57708.tm.fl : 

5'-AACGTGCTACACGACCAGTGTACT-3' 
57708.tm.rl : 

5'-CACAGCATATTCAGATGACTAAATCCA-3' 
57708.tm.pl : 

35 5'-TTGTTTAGTTCTCCACCGTGTCTCCACAGAA-3' 
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(SEQ ID NO:441) 
(SEQ ID NO:442) 

(SEQ ID NO:443) 
(SEQ ID NO:444) 
(SEQ ID NO:445) 

(SEQ ID NO:446) 
(SEQ ID NO:447) 
(SEQ ID NO:448) 

(SEQ ID NO:449) 
(SEQ ID NO:450) 
(SEQ ID NO:451) 

(SEQ ID NO:452) 
(SEQ ID NO:453) 
(SEQ ID NO:454) 
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PRQlQQ9(PNA57U9-mi3): 
57129.tm.fl : 

5'-TGTCAGA ATGCAACCTGGCTT-3 * (SEQ ID NO:455) 
57129.tm.rl : 

5'-TGATGTGCCTGGCTCAGAAC-3' (SEQ ID NO:456) 
5 57129.tni.Dl : 

5 -TGCACCTAGATGTCCCCAGCACCC-3' (SEQ ID NO:457) 

PRO1097 (DNA59841-I460): 
59841. tm.fl: 

10 5'-AAGATGCGCCAGGCTTCTTA-3' (SEQ ID NO:458) 
59841. tm.rl: 

5'-CTCCTGTACGGTCTGCTCACTTAT-3' (SEQ ID NO:459) 
5984Ltm.Dl: 

5'-TGGCTGTCAGTCCAGTGTGCATGG-3' (SEQ ID NO:460) 

15 

PRO1107 fDNA59606-147n: 
59606.tni.fl: 

5'-GCATAGGGATAGATAAGATCCTGCTTTAT-3* (SEQ ID NO:46l) 
59606.tm.rl : 

20 5'-CAAATTAAAGTACCCATCAGGAGAGAA-3' (SEQ ID NO:462) 
59606.tm.ol: 

5'-AAGTTGCTAAATATATACATTATCTGCGCCAAGTCCA-3' (SEQ ID Nb:463) 



PR01111fDNA58721-1475>: 
25 58721. tm.fl: 

5'-GTGCTGCCCACAATTCATGA-3' (SEQ ID NO:464) 
58721. tm.rl: 

5'-GTCCTTGGTATGGGTCTGAATTATAT.3' (SEQ ID NO:465 
58721.tm.pl: 

30 5*-ACTCTCTGCACCCCACAGTCACCACTATCTC-3* (SEQ ID NO:466) 



PR01153 (DNA59842-1S02): 
59842.tm.fl: 

5 '-CTGAGGAACCAGCCATGTCTCT-3 ' (SEQ ID NO:467) 
35 59842.tm.rl : 

5'-GACCAGATGCAGGTACAGGATGA-3* (SEQ ID NO:468) 



542 



wo 00/73454 




PCT/USOO/08439 



g9842. pn,p^: 

5'-CrGCCCCTTCAGTGATGCCAACCTT-3' (SEQ ID NO:469) 

PROU82 (DNAS9848-1512): 
59848.tm.fl: 

5 5'-GGGTGGAGGCTCACTGAGTAGA.3* (SEQ ID NO:470) 
59848.tm,r^: 

5'-CAATACAGGTAATGAAACTCTGCTTCTT-3" (SEQ ID NO:47l) 
^9848.tm.p^; 

5'-TCCTCTTAAGCATAGGCCATTTTCTCAGTTTAGACA-3' (SEQ ID NO:472) 

10 

PRO1184(DNA59220-I514>: 

;59?2Q,tm,n; 

5*-GGTGGTCTTGCTTGGTCTCAC-3' 
59220.tm.rl: 
15 5'-CCGTCGTTCAGCAACATGAC-3* 
59220. tm.p I: 

5'-ACCGCCTACCGCTGTGCCCA-3' 

PR01187 (DNA62876-1517): 
20 62876.tm.fl: 

5'-CAGTAAAACCACAGGCTGGATTT-3' 
62876.tm.rl: 

5'-CCTGAGAGCAAGAAGGTTGAGAAT-3* 
62876.tm.p l: 
25 5'-TAGACAGGGACCATGGCCCGCA-3' 

PR01281 (DNA59820-1549): 
5982Q.tm.fl: 

5 '-TGGGCTGTAGAAGAGTTGTTG-3 * (SEQ ID NO:479) 
30 5?820.tm.rl; 

5'-TCCACACTTGGCCAGTTTAT-3' (SEQ ID NO:480) 
^9820.tm.pl; 

5'-CCCAACTTCTCCCTnTGGACCCT-3' (SEQ ID NO:481) 

35 PRQ1112fDNA57702-1476>: 
^77Q2.tm.f]i 

5'-GTCCCTTCACTGTTTAGAGCATGA-3* (SEQ ID NO:482) 



(SEQ ID NO:473) 
(SEQ ID NO:474) 
(SEQ ID NO:475) 



(SEQ ID NO:476) 
(SEQ ID NO:477) 
(SEQ ID NO:478) 
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57702.tm.pl 

5*-ACTCTCCCCCTCAACAC}CCTCCTGAG-3' (SEQ ID NO:483) 

577024m,rl 

5'-GTGGTCAGGGCAGATCCTTT-3' (SEQ ID NO:484) 



5 PR01185 fDNA6288M515>: 
02g8l.tm.fl; 

S'-ACAGATCCAGGAGAGACTCCACA -3' (SEQ ID NO:485) 
62831. tm.pl; 

5'-AGCGGCGCTCCCAGCCTGAAT -3' (SEQ ID NO:486) 

10 02ggn . tm.rl : 

5'-CATGATTGGTCCTCAGTTCCATC -3' (SEQ ID NO:487) 

PR01245 fDNA64884-1527>: 
64884.tm.fl: 

15 5'-ATAGAGGGCTCCCAGAAGTG -3* (SEQ ID NO:488) 

64884. tm.p I: 

5'-CAGGGCCTTCAGGGCCTTCAC-3* (SEQ ID NO:489) 

64g84.tm.rl: 

5*-GCTCA(X;CAAACACTGTCA-3' (SEQ ID NO:490) 

20 64884.tm.£2: 

5'-GGGGCCCTGACAGTGTT -3' (SEQ ID NO:491) 

64884.tm.p2: 

5'-CTGAGCCGAGACTGGAGCATCTACAC-3' (SEQ ID NO:492) 

64884.tm.r2: 

25 5'-GTGGGCAGCGTCTTGTC-3' (SEQ ID NO: 493) 

The 5' nuclease assay reaction is a fluorescent PCR-based technique which makes use of the 5' 
exonuclease activity of Taq DNA polymerase enzyme to monitor amplification in real time. Two 
oligonucleotide primers (forward [.Q and reverse [.r]) are used to generate an amplicon typical of a PGR 
reaction. A third oligonucleotide, or probe (.p), is designed to detect nucleotide sequence located between the 

30 two PGR primers. The probe is non-extendible by Taq DNA polymerase enzyme, and is labeled with a 
reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission from the reporter dye 
is quenched by the quenching dye when the two dyes are located close together as they are on the probe. 
During the amplification reaction, the Taq DNA polymerase enzyme cleaves the probe in a template-dependent 
manner. The resultant probe fragments disassociate in solution, and signal from the released reporter dye is 

35 free from the quenching effect of the second fluorophore. One molecule of reporter dye is liberated for each 
new molecule synthesized, and detection of the unquenched reporter dye provides the basis for quantitative 
interpretation of the data. 
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The 5' nuclease procedure is run on a real-time quantitative PGR device such as the'ABI Prism 
7700TM Sequence Detection, The system consists of a thermocycler, laser, charge-coupled device (CCD) 
camera and computer. The system amplifies samples in a 96-well format on a thermocycler. During 
amplification, laser-induced fluorescent signal is collected in real-time through fiber optics cables for all 96 
wells, and detected at the CCD. The system includes software for running the instrument and for analyzing 
the data. 

5' Nuclease assay data are initially expressed as Ct, or the threshold cycle. This is defined as the 
cycle at which the reporter signal accumulates above the background level of fluorescence. The ACt values 
are used as quantitative measurement of the relative number of starting copies of a particular target sequence 
in a nucleic acid sample when coiiq)aring cancer DNA results to normal human DNA results. 

Table 8 describes the stage. T stage and N stage of various primary tumors which were used to screen 
the PRO polypeptide compoimds of the invention. 
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Table 8 

Primary Lung and Colon Tumor Profiles 



Primary Tumor Stase 


Stage 


Other Staee 


Pukes Stag 


e T Stage N Stat 


Human lung tumor AdenoCa (SRCC724) [LTl] 


IIA 






Tl 


Nl 


Human lung tumor SqCCa (SRCC725) [LTl a] 


IIB 






T3 


NO 


Human lung tumor AdenoCa (SRCC726) [LT2] 


IB 






T2 


NO 


Human lung tumor AdenoCa (SRCC727) [LT3] 


UIA 






Tl 


N2 


Human lung tumor AdenoCa (SRCC728) [LT4] 


IB 






T2 


NO 


Human lung tumor SqCCa (SRCC729) [LT6] 


IB 






T2 


NO 


Human lung tumor Aden/SqCCa (SRCC730) (LT7] 


lA 






Tl 


NO 


Human lung tumor AdenoCa (SRCC731) [LT9] 


IB 






T2 


NO 


Human lung tumor SqCCa (SRCC732) [LTIO] 


IIB 






T2 


Nl 


Human lung tumor SqCCa (SRCC733) [LTl 1] 


IIA 






Tl 


Nl 


Human lung tumor AdenoCa (SRCC734) [LTl 2] 


IV 






T2 


NO 


Human lung tumor AdenoSqCCa (SRCC735)[LT13)IB 






T2 


NO 


Muman lung nimor oqt^i-'a \otsx.\^fjO) ii-i uj 


lO 






T2 


NO 


Tollman hino himnr ^nCCa ^^IPPniTi FT T^fi'l 


TR 






T2 


NO 


Human lung tumor SqCCa (SRCC738) [LTl 7] 


IIB 






T2 


Nl 


Human lung tumor SqCCa (SRCC739) [LTIS] 


IB 






T2 


NO 


Human lung tumor SqCCa (SRCC740) [LT19] 


IB 






T2 


NO 


Human lung tumor LCCa (SRCC741) [LT21] 


IIB 






T3 


Nl 


Human lung AdenoCa (SRCC811) [LT22] 


lA 






Tl 


NO 


Human colon AdenoCa (SRCC742) [CT2] 




Ml 


D 


pT4 


NO 


Human colon AdenoCa (SRCC743) [CT3] 






B 


dT3 


NO 


Human colon AdenoCa (SRCC744) [CT8] 






B 


T3 


NO 


Human colon AdenoCa (SRCC745) [CTIO] 






A 


pT2 


NO 


Human colon AdenoCa (SRCC746) [CT12] 




MO. Rl 


B 


T3 


NO 


Human colon AdenoCa (SRCC747) [CT14] 




pMO, RO 


B 


pT3 


pNO 


Human colon AdenoCa (SRCC748) (CT15] 




Ml, R2 


D 


T4 


N2 


Human colon AdenoCa (SRCC749) [CT16] 




pMO 


B 


pT3 


pNO 


Human colon AdenoCa (SRCC750) [CT17] 






CI 


pT3 


pNl 


Human colon AdenoCa (SRCC751) [CTl] 




MO, Rl 


B 


pT3 


NO 


Human colon AdenoCa (SRCC752) [CT4] 






B 


pT3 


MO 


Human colon AdenoCa {SRCC753) [CT5] 




G2 


CI 


pT3 


pNO 


Human colon AdenoCa (SRCC754) [CT6] 




pMO, RO 


B 


pT3 


pNO 


Human colon AdenoCa (SRCC755) [CT7] 




Gl 


A 


pT2 


pNO 


Human colon AdenoCa (SRCC756) [CT9] 




G3. 


D 


pT4 


pN2 


Human colon AdenoCa (SRCC757) [CTll] 






B 


T3 


NO 


Human colon AdenoCa (SRCC758) [CTl 8] 




MO, RO 


B 


pT3 


pNO 



DNA Preparation : 

DNA was prepared from cultured cell lines, primary tumors, normal human blood. The isolation was 
performed using purification kit, buffer set and protease and all from Quiagen, according to the manufacturer's 
instructions and the description below. 

Cell culture lysis: 

Cells were washed and trypsinized at a concentration of 7.5 x 10^ per tip and pelleted by centrifuging 
at 1000 rpm for 5 minutes at 4*^, followed by washing again with 1/2 volume of PBS recentrifugation. The 
pellets were washed a third time, the suspended cells collected and washed 2x with PBS. The cells were then 
suspended into 10 ml PBS. Buffer CI was equilibrated at 4*'C. Qiagen protease #19155 was diluted into 6.25 
ml cold ddHjO to a final concentration of 20 mg/ml and equilibrated at 4'*C. 1 0 ml of G2 Buffer was prepared 
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by diluting Qiagen RNAse A stock (100 mg/ml) to a final concentration of 200 Mg/ml. 

Buffer CI (10 ml, 4°C) and ddH20 (40 ml, 4*C) were then added to the 10 ml of cell suspension, 
mixed by inverting and incubated on ice for 10 minutes. The cell nuclei were pelleted by centrifuging in a 
Beckman swinging bucket rotor at 2500 rpra at 4*0 for 15 minutes. The supernatant was discarded and the 
nuclei were suspended with a vortex into 2 ml Buffer CI (at 4^) and 6 ml ddHjO, followed by a second 4°C 
5 centrifugation at 2500 rpm for 15 minutes. The nuclei were then resuspended into the residual buffer using 
200 tA per tip. G2 buffer (10 ml) was added to the suspended nuclei while gentle vortexing was applied. 
Upon completion of buffer addition, vigorous vortexing was applied for 30 seconds. Quiagen protease (200 
/il, prepared as indicated above) was added and incubated at 50^ for 60 minutes. The incubation and 
centrifugation was repeated until the ly sates were clear (e.g., incubating additional 30-60 minutes, pelleting 
10 at 3000 X g for 10 min., 4'»C). 

Solid human tumor sample preparation and lysis: 

Tumor samples were weighed and placed into 50 ml conical tubes and held on ice. Processing was 
limited to no more than 250 mg tissue per preparation (1 tip/preparation). The protease solution was freshly 
prepared by diluting into 6.25 ml cold ddHjO to a final concentration of 20 mg/ml and stored at 4**C. G2 

15 buffer (20 ml) was prepared by diluting DNAse A to a final concentration of 200 mg/ml (firom 100 mg/ml 
stock). The tumor tissue was horaogenated in 19 ml G2 buffer for 60 seconds using the large tip of the 
polytron in a laminar-flow TC hood in order to avoid inhalation of aerosols, and held at room temperature. 
Between samples, the polytron was cleaned by spinning at 2 x 30 seconds each in 2L ddHjO, followed by 02 
buffer (50 ml). If tissue was still present on the generator tip, the apparatus was disassembled and cleaned. 

20 Quiagen protease (prepared as iiulicated above, 1.0 ml) was added, followed by vortexing and 

incubation at 50^ for 3 hours. The incubation and centrifugation was repeated until the lysates were clear 
(e.g., incubating additional 30-60 minutes, pelleting at 3000 x g for 10 min., 4'^). 
Human blood preparation and lysis: 

Blood was drawn from healthy volunteers using standard infectious agent protocols and citrated into 
25 10 ml samples per tip. (Qiagen protease was freshly prepared by dilution into 6.25 ml cold ddHjO to a final 
concentration of 20 mg/ml and stored at 4'*C. G2 buffer was prepared by diluting RNAse A to a final 
concentration of 200 A*g/ml from 1(X) mg/ml stock. The blood (10 ml) was placed into a 50 ml conical mbe 
and 10 ml CI buffer and 30 ml ddH^O (both previously equilibrated to 4*^) were added, and the components 
mixed by inverting and held on ice for 10 minutes. The nuclei were pelleted with a Beckman swinging bucket 
30 rotor at 2500 rpm, 4°C for 15 minutes and the supernatant discarded. With a vortex, the nuclei were 
suspended into 2 ml CI buffer (4''C) and 6 ml ddHjO (4*^). Vortexing was repeated until the pellet was white. 
The nuclei were then suspended into die residual buffer using a 200 fA tip. G2 buffer (10 ml) were added to 
the suspended nuclei while gently vortexing. followed by vigorous vortexing for 30 seconds. (Juiagen protease 
was added (200 f^l) and incubated at 50°C for 60 minutes. The incubation and centrifugation was repeated until 
35 the lysates were clear (e.g.^ incubating additional 30-60 minutes, pelleting at 3000 x g for 10 min., 4*'C). 
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Purification of cleared fysates: 

(1) Isolation of genomic DNA : 

Genomic DNA was equilibrated (1 sample per maxi tip preparation) with 10 ml QBT buffer. QF 
elution buffer was equilibrated at 50*C. The samples were vortexed for 30 seconds, then loaded onto 
equilibrated tips and drained by gravity. The tips were washed with 2 x 15 ml QC buffer. The DNA was 
5 eluted into 30 ml silanized, autoclaved 30 ml Corex tubes with 15 ml QF buffer (50^). Isopropanol (10.5 
ml) was added to each sample, the mbes covered with parafin and mixed by repeated inversion until the DNA 
precipitated. Samples were pelleted by centrifugation in the SS-34 rotor at 15,000 rpm for 10 minutes at 4X:. 
The pellet location was marked, die supernatant discarded, and 10 ml 70% ethanol (4**C) was added. Samples 
were pelleted again by centrifugation on the SS-34 rotor at 10,000 rpm for 10 minutes at 4"C. The pellet 
10 location was marked and the supernatant discarded. The tubes were then placed on their side in a drying rack 
and dried 10 minutes at 31°C, taking care not to overdry the samples. 

After drying, the pellets were dissolved into 1.0 ml TE (pH 8.5) and placed at 5(y*C for 1-2 hours. 
Samples were held overnight at 4**C as dissolution continued. The DNA solution was then transferred to 1 .5 
ml tubes with a 26 gauge needle on a tuberculin syringe. The transfer was repeated 5x in order to shear the 
15 DNA. Samples were then placed at SO^C for 1-2 hours. 

(2) Quantitation of genomic DNA and preparation for gene amplification assay : 

The DNA levels in each tube were quantified by standard Aj^, Ajgo spectrophotometry on a 1:20 
dilution (5 fA DNA + 95 fj.1 ddH^O) using the 0.1 ml quartz cuvetts in the Beckman DU640 
spectrophotometer. Ajet/Aiso ratios were in the range of 1 .8-1 .9. Each DNA samples was then diluted further 

20 to approximately 200 ng/ml in TE (pH 8.5). If the original material was highly concentrated (about 700 
ng//il), the material was placed at SO^C for several hours until resuspended. 

Huorometric DNA quantitation was then performed on the diluted material (20-600 ng/ml) using the 
manufacturer's guidelines as modified below. This was accomplished by allowing a Hoeffer DyNA Qusnt 200 
fluorometer to warm-up for about 15 minutes. The Hoechst dye working solution (#H33258, 10 fxl, prepared 

25 within 12 hours of use) was diluted into 100 ml 1 x TNE buffer. A 2 ml cuvette was filled with the 
fluorometer solution, placed into the machine, and the machine was zeroed. pGEM 3Zf(+) (2 fA, lot 
#360851026) was added to 2 ml of fluorometer solution and calibrated at 200 units. An additional 2 fA of 
pGEM 3Zf(+) DNA was then tested and the reading confirmed at 400 +/- 10 units. Each sample was then 
read at least in triplicate. When 3 samples were found to be within 10% of each other, their average was taken 

30 and this value was used as the quantification value. 

The fluorometricly determined concentration was then used to dilute each sample to 10 n$/fA in 
ddHiO. This was done simultaneously on all template samples for a single TaqMan plate assay, and with 
enough material to run 500-1000 assays. The samples were tested in triplicate with Taqman™ primers and 
probe both B-actin and GAPDH on a single plate with normal human DNA and no-template controls. The 

35 diluted samples were used provided that the CT value of normal human DNA subtracted from test DNA was 
+/- 1 Ct. The diluted, lot-qualified genomic DNA was stored in 1.0 ml aliquots at -80°C. Aliquots which 
were subsequently to be used in the gene amplification assay were stored at 4'C Each 1 ml aliquot is enough 
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for 8-9 plates or 64 tests. 

Gene amplijicaiion assay: 

The PRO polypeptide compounds of the invention were screened in the following primary tumors and 
the resulting ACt values greater than or equal to 1.0 are reported in Tables 9A-C below. 
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Table 9C 

ACt values in lung and colon primary tumors and cell line models 

Primary PR01182 PROH84 PR01187 PRO 1281 
Tumor 

LT-1 1.81 - - - 

LT-la — 1.14 — — 

1.09 

LT4 1.43 1.37 — — 

1.18 

LT6 — 1.78 — — 

1.66 
1.05 

LT9 1.43 _ _ 

LT12 ~ 2.47 1.17 — 

2.61 

1.80 

LT15 — ~ 1.55 — 

LT16 — 1.01 1-33 — 

LT17 — — 

LT18 — 1.07 — - 

1.13 

LT19 — .1-19 — --: 

1.35 
1.02 

LT21 — 1.00 

1.20 

Cr2 — - — 1.15 

CT12 — — — 1.07 
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Because amplification of the various DNAs described above occurs in various cancerous tumors and 
tumor cell lines derived from various human tissues, these molecules likely play a significant role in tumor 
formation and/or growth. As a result, amplification and/or enhanced expression of these molecules can serve 
as a diagnostic for detecting the presence of tumor in an individual and antagonists {e.g. , antibodies) directed 
against the proteins encoded by the above described DNA molecules would be expected to have utility in cancer 
5 therapy. 

EXAMPLE 171 : Identification of Receotor/Lig and Interactions 

In this assay, various PRO polypeptides are tested for ability to bind to a panel of potential receptor 
molecules for the purpose of identifying receptor/ligand interactions. The identification of a Hgand for a 

10 known receptor, a receptor for a known ligand or a novel receptor/ligand pair is useful for a variety of 
indications including, for exan:q>le. targeting bioactive molecules (linked to the ligand or receptor) to a cell 
known to express the receptor or ligand, use of the receptor or ligand as a reagent to detect the presence of the 
ligand or receptor in a composition suspected of containing the same, wherein the composition may comprise 
cells suspected of expressing the ligand or receptor, modulating the growth of or another biological or 

15 immunological activity of a cell known to express or respond to the receptor or ligand, modulating the immune 
response of cells or toward cells that express the receptor or ligand, allowing the preparaion of agonists, 
antagonists and/or antibodies directed against the receptor or ligand wliich will modulate the growth of or a 
biological or immunological activity of a cell expressing the receptor or ligand, and various other indications 
which will be readily apparent to the ordinarily skilled artisan. 

20 The assay is performed as follows. A PRO polypeptide of the present invention suspected of being 

a ligand for a receptor is e:q)ressed as a fusion protein containing the Fc domain of human IgG (an 
immunoadhesin). Receptor-ligand binding is detected by allowing interaction of the immunoadhesin 
polypeptide witii cells (e.g. Cos cells) expressing candidate PRO polypeptide receptors and visualization of 
bound immunoadhesin with fluorescent reagents directed toward the Fc fusion domain and examination by 

25 microscope. Cells expressing candidate receptors are produced by transient transfection, in parallel, of defined 
subsets of a library of cDNA expression vectors encoding PRO polypeptides that may function as receptor 
molecules. Cells are then incubated for 1 hour in the presence of the PRO polypeptide immunoadhesin being 
tested for possible receptor binding. The cells are then washed and fixed with paraformaldehyde. The cells 
are then incubated with fluorescent conjugated antibody directed against the Fc portion of the PRO polypeptide 

30 immunoadhesin (e.g. FITC conjugated goat anti-human-Fc antibody). The cells are then washed again and 
examined by microscope. A positive interaction is judged by the presence of fluorescent labeling of cells 
transfected with cDNA encoding a particular PRO polypeptide receptor or pool of receptors and an absence 
of similar fluorescent labeling of similarly prepared cells that have been transfected with other cDNA or pools 
of cDNA. If a defined pool of cDNA expression vectors is judged to be positive for interaction with a PRO 

35 polypeptide inmiunoadhesin, the individual cDNA species that comprise the pool are tested individually (the 
pool is "broken down") to determine the specific cDNA that encodes a receptor able to interact with the PRO 
polypeptide inmiunoadhesin. 
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In another embodiment of this assay, an epiiope-tagged potential ligand PRO polypeptide (e.g. 8 
histidine "His" tag) is allowed to interact with a panel of potential receptor PRO polypeptide molecules that 
have been expressed as fusions with the Fc domain of human IgG (immunoadhesins). Following a 1 hour 
co-incubation with the epitope tagged PRO polypeptide, the candidate receptors are each immunoprecipitated 
with protein A beads and the beads are washed. Potential ligand interaction is determined by western blot 
analysis of the immunoprecipitated complexes with antibody directed towards the epitope tag. An interaction 
is judged to occur if a band of the anticipated molecular weight of the epitope tagged protein is observed in the 
western blot analysis with a candidate receptor, but is not observed to occur with the other members of the 
panel of potential receptors. 

Using these assays, the following receptor/ligand interactions have been herein identified: 

(1) PR0943 binds to FHFl. PR0183 (FHF2). PR0184 (FHF3) and PR0185 (FHF4) and vice versa. 

(2) PR033 1 binds to PROl 133 and vice versa. 

(3) PR0363 binds to PRO 1387 and vice versa. 

(4) PR05723 binds to PR01387 and vice versa. 

(5) PROl 1 14 binds to PRO3301 and PRO9940 and vice versa. 

(6) PR09828 appears to be a novel fibroblast growth factor receptor (FGFR) ligand in that it binds to the 
known FGF receptors FGFRl, FGFR2niC, FGFR3IIIC and FGFR4. PR09828 and agonists, 
therefore, will fmd use for activating the biological activities normally activated by FGF niolecules 
including, for example, cell growth and proliferation. Antagonists of PR09828 will find use in 
blocking the biological activities mediated through the FGF receptor. 

(7) PROl 181 binds to PRO7170, PR0361 and PR0846. 



Oligonucleotide probes were constructed from the PRO polypeptide-encoding nucleotide sequences 
shown in the figures for use in quantitative PGR amplification reactions. The oligonucleotide probes were 
chosen so as to give an approximately 200-600 base pair amplified fragment fi-om the 3' end of its associated 
template in a standard E*CR reaction. The oligonucleotide probes were employed in standard quantitative PGR 
amplification reactions with cDNA libraries isolated from different human adult and/or fetal tissue sources and 
analyzed by agarose gel electrophoresis so as to obtain a quantitative determination of the level of expression 
of the PRO polypeptide-encoding nucleic acids in the various tissues tested. Knowledge of the expression 
pattern or the differential expression of the PRO polypeptide-encoding nucleic acids in various different human 
tissue types provides a diagnostic marker usefiil for tissue typing, with or without other tissue-specific markers, 
for determining the prinaary tissue source of a metastatic tumor, disease diagnosis, and the like. These assays 
provided the following results. 

DNA Molecule Tmv^ w/ Significant pxpr^ssion Tissyes w^o Significant Pxpr^ssipn 

DN A 1 6422- 1 209 substantia nigra , dendrocytes, uterus hippocampus 

DNA 16435- 1208 substantia nigra, dendrocytes, uterus hippocampus 

DNA26843-1389 dendrocytes, heart, uterus, colon tumor hippocampus, substantia nigra, 



EXAMPLE 172 : Tissue Expression Distribution 
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DNA Molecule 
DNA26844-1394 

DNA40621-1440 
DNA4416M434 

DNA44694-1500 
DNA48320-1433 
DNA49647-1398 
DNA53913-1490 
DNA53978-1443 
DNA53996-1442 
DNA56050-1455 
DNA56I10-1437 
DNA56410-1414 
DKA56436-1448 
DNA56855-1447 
DNA56860-1510 
DNA56868-1478 
DNA56869-1545 
DNA57699-1412 
DNA57704-1452 
DNA57710-1451 
DNA577! 1-1501 
DNA57827-1493 
DNA58723-1588 
DNA58743-1609 
DNA58846-1409 

DNA58849-1494 

DNA58850-1495 

DNA59213-1487 
DNA59497-1496 

DNA59605-1418 

DNA59609-1470 

DNA59612-1466 

DNA59616-1465 
DNA59619-1464 
DNA59625-1498 
DNA59827-1426 
DNA59828-1608 
DNA59853-1505 

DNA59854-1459 
DNA60283-1484 
DNA60619-1482 
DNA60625-1507 
DNA60629-1481 

DNA61755-1554 
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Tissues w/ S ipiificant Exoression 
HUVEC, dendrocytes, cartilage 

prostate, uterus, colon tumor 
colon tumor, dendrocytes 

dendrocytes, hippocampus, prostate 
prostate, uterus 
brain, heart, prostate, uterus 
hippocampus 

dendrocytes, uterus, prostate 
spleen, prostate, uterus, hippocampus 
prostate, uterus, cartilage, hippocampus 
spleen, colon tumor, brain, prostate 
uterus, dendrocytes 
substantia nigra, prostate, hippocampus 
prostate, uterus 
colon tumor 
colon tumor, prostate 
prostate, uterus, cartilage 
dendrocytes, hippocampus, prostate 
brain, heart, spleen, uterus, prostate 
dendrocytes, hippocampus, spleen, uterus 
dendrocytes, hippocampus, heart, cartilage 
colon tumor, hippocan:q)us, prostate 
substantia nigra, cartilage uterus 
brain, prostate, uterus 
hippocampus, dendrocytes 

prostate 

spleen, prostate, dendrocytes 

spleen, cartilage, prostate, substantia nigra 
dendrocytes, prostate, uterus, heart 

dendrocytes, prostate, uterus 

dendrocytes 

prostate, dendrocytes 

dendrocytes, substantia nigra, colon tumor 
dendrocytes, substantia nigra, colon tumor 
brain, colon tumor, prostate, uterus 
substantia nigra, prostate, uterus 
dendrocytes, substantia nigra, colon tumor 
prostate 

cartilage 

dendrocytes, spleen, prostate, uterus 
dendrocytes, substantia nigra, colon tumor 
cartilaqge 

uterus, colon tumor, substantia nigra 
dendrocytes, substantia nigra, colon tumor 



Tissues w/o Significant Expression 
substantia nigra, hippocampus, uterus, 
prostate 

brain, heart, HUVEC, cartilage 
substantia nigra, hippocampus, 
prostate, uterus 

colon tumor, substantia nigra, heart 
colon tumor, brain, heart, cartilage 
cartilage 

substantia nigra, dendrocytes 
substantia nigra, colon tumor 
substantia nigra, heart 
heart, colon tumor, dendrocytes 
heart 

hippocampus, substantia nigra, heart 
dendrocytes, heart, HUVEC 
brain, cartilage, heart, colon tumor 
prostate, uterus, dendrocytes 
uterus, brain, heart, cartilage 
brain, colon tumor, spleen, heart 
sunstantia nigra, heart 
colon tumor 
substantia nigra, heart 
substantia nigra 

simstantia nigra, dendrocytes, uienis 
hippocampus, dendrocytes, HUVEC 
colon tumor, heart, spleen, cartilage 
substantia nigra, uterus, prostate, 
colon tumor 

brain, uterus, cartilage, heart, colon 
tumor 

hippocampus, substantia nigra, colon 
tumor 

heart, hippocampus, dendrocytes 
cartilage, hippocampus, substantia 
nigra 

hippocampus, substantia nigra, colon 
tumor 

substantia nigra, hippocampus, heart, 

prostate, uterus, spleen 

hippocampus, substantia nigra, uterus, 

colon tumor 

hippocampus 

hippocampus 

THP-1 macrophages 

hippocampus, dendrocytes, heart 

hippocampus 

brain, uterus, spleen, heart, colon 
tumor 

prostate, brain, heart, colon tumor 
hippocampus, substantia nigra, heart 
hippocampus 

prostate, brain, heart, colon tumor 
hippocampus, dendrocytes, spleen, 
prostate 
hippocampus 
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DNA64852-1589 
DNA66308-1537 
DNA68869-1610 



Tissues w/ Significant Expression 

prostate, uterus 

prostate » heart uterus 

spleen, prostate, heart, uterus, colon 

tumor, substantia nigra 



Tissues w/q Significant Expression 
brain, heart, cartilage, colon tumor 
brain, colon tumor, cartilage 



hippocampus, dendiocytes, prostate 



EXAMPLE 173: Isolation of cDN A Clones Encoding Human PR0846 



A consensus sequence was obtained relative to a variety of EST sequences as described in Example 
1 above, wherein the consensus sequence obtained is herein designated DNA39949. Based on the DNA39949 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 



Forward and reverse PGR primers were synthesized: 
forward PGR primer 5'-CCCTGCAGTGCACCTACAGGGAAG.3' (SEQ ID NO:518) 
reverse PGR primer 5'-CTGTCTTGCCCTGCTTGGCTGTGG-3' (SEQ ID NO:519) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39949 
sequence which had the following nucleotide sequence 
hybridization probe 

5 ■ -GGTGGAGGA AGGGTGGG ATGCTCTTCTCTGGGTGGTGTGGGC AC ATC-3 ' 
(SEQ ID NO:520) 

In order to screen several libraries for a source of a full-length clone, DNA from \he libraries was 
screened by PGR amplification with one of the PGR primer pairs identified above, A positive library was then 
used to isolate clones encoding the PR0846 gene using the probe oligonucleotide and one of the PGR primers. 
RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0846 [herein designated as DNA44 196-1353] (SEQ ID NO:516) and the derived protein sequence for 
PR0846. 

The entire nucleotide sequence of UNQ422 (DNA44 196- 1353) is shown In Figure 329 (SEQ ID 
NO:516). Clone UNQ422 (DNA44196-1353) contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 25-27 and ending at the stop codon at nucleotide positions 
1021-1023 (Figure 329). The predicted polypeptide precursor is 332 amino acids long (Figure 330). The 
full-length PR0846 protein shown in Figure 330 has an estimated molecular weight of about 36,143 dalions 
and a pi of about 5.89. Important regions of the amino acid sequence of PR0846 include the signal peptide, 
the transmembrane domain, an N-glycosylation site, a sequence typical of fibrinogen beta and gamma chains 
C-terminal domain, and a sequence typical of Ig like V-type domain as shown in Figure 330. Clone UNQ422 
(DNA44196-1353) has been deposited with ATCC and is assigned ATGC deposit no. 209847. 



PR0846. 
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EXAMPLE 174 : Isolation of cDNA Clones Encodin g Human PR0363 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 
1 above, wherein the consensus sequence obtained is herein designated DNA42828. Based on the DNA42828 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PR0363. 

A pair of PGR primers (forward and reverse) were synthesized: 
forward PGR primer (42828.fn 5*-CGAGTGGACAGCAGGCAAGGAAGC-3' (SEQ ID NO:521) 
reverse PGR primer (42828.rn 5'-AGTAGGGTGTATGCGTGGGTGGGG-3* (SEQ ID NO:522) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA42828 
10 sequence which had the following nucleotide sequence 
hybridization probe f42828.pn 

5'-GTATGTACAAAGCATGGGGATGGTTGGAGGAGCAGTGAGAGGC-3' (SEQ ID NO:523) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 

screened by PGR amplification with the PGR primer pair identified above. A positive library was then used 
15 to isolate clones encoding the PR0363 gene using the probe oligonucleotide and one of the PGR primers. 

RNA for construction of the cDNA libraries was isolated firom human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0363 [herein designated as UNQ318 (DNA45419-1252)] (SEQ ID NO:500) and the derived protein 

sequence for PR0363. 

20 The entire nucleotide sequence of UNQ318 (DNA45419-1252) is shown in Figure 313 (SEQ ID 

NO:500). Glone UNQ318 (DNA45419-1252) contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 190-192 and ending at the stop codon at nucleotide positions 
1309-13 1 1 (Figure 3 13). The predicted polypeptide precursor is 373 amino acids long (Figure 314). The full- 
length PR0363 protein shown in Figure 314 has an estimated molecular weight of about 41 ,281 daltons and 

25 a pi of about 8.33. A transmembrane domain exists at amino acids 221 to 254 of the amino acid sequence 
shown in Figure 314 (SEQ ID NO:501). The PR0363 polypeptide also possesses at least two myelin PO 
protein domains from about amino acids 15 to 56 and from about amino acids 87 to 116. Glone UNQ318 
(DNA45419-1252) has been deposited with ATCG on February 5, 1998 and is assigned ATGC deposit no. 
209616. 

30 Analysis of the amino acid sequence of the full-length PR0363 polypeptide suggests that it possesses 

significant sequence similarity to the cell surface protein HGAR, thereby indicating that PR0363 may be a 
novel HGAR homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0363 amino acid sequence and the following Dayhoff 
sequences, HS46KDA_1, HSU90716_1. MMCARH_1, MMGARH0M_1, MMU90715_1, A33_HUMAN, 

35 P_W14146. P_W14158, A42632 and B42632. 
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EXAMPLE 175 : Isolation of cDNA Clones Encoding a Human PR09g2S 

A consensus DNA sequence was assembled relative to other nucleic sequences using phrap as 
described in Exanq)le 1 above. This consensus sequence is herein designated DNA139814. Based on the 
DNA139814 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library 
that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
5 sequence for PR09828. Forward and reverse PGR primers generally range from 20 to 30 nucleotides and are 
often designed to give a PGR product of about 1 00-1000 bp in length. The probe sequences are typically 40-55 
bp in length. In some cases, additional oligonucleotides are synthesized when the consensus sequence is 
greater than about 1 - 1 .Skbp. In order to screen several libraries for a full-length clone, DNA from the libraries 
was screened by PGR amplification, as per Ausubel et al. , Current Protocols in Molecular Biology , supra, with 
10 the PGR primer pair. A positive library was then used to isolate clones encoding the gene of interest using 
the probe oligonucleotide and one of the primer pairs. 

PGR primers (forward and reverse) were synthesized: 
5'-AATGTCAGCACCAGGCACTCAGAGCA-3' (SEQ ID NO:524) 
5'-GTTAAAGAGGGTGCCGTTCCAGCGA-3' (SEQ ID NO:525) 
15 5'-TATCGCAATGCCTCCGGACTGCTG-3* (SEQ ID NO:526) 
5'-GATGAACTTGC}CGAAGGGGCGGCA-3' (SEQ ID NO:527) 

RNA for construction of the cDNA libraries was isolated from human fetal liver tissue. The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a 
20 NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al.. Science . 25^: 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for a 
25 full-length PR09828 polypeptide (designated herein as DNA 1 42238-2768 [Figure 323. SEQ ID NO:5101) and 
the derived protein sequence for that PR09828 polypeptide. 

The full length clone identified above contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 232-234 and a stop signal at nucleotide positions 985-987 
(Figure 323, SEQ ID N0:5 10). The predicted polypeptide precursor is 25 1 amino acids long, has a calculated 
30 molecular weight of approximately 27,954 daltons and an estimated pi of approximately 9.22. Analysis of the 
full-length PR09828 sequence shown in Figure 324 (SEQ ID NO: 5 1 1) evidences the presence of a variety of 
important polypeptide domains as shown in Figure 324, wherein the locations given for those important 
polypeptide domains are approximate as described above. Chromosome mapping evidences that the PR09828- 
encoding nucleic acid maps to chromosome I2pl3 in humans. Clone DNA 142238-2768 has been deposited 
35 with ATCC on October 5. 1999 and is assigned ATCC deposit no. 819-PTA. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using the ALIGN-2 sequence 
alignment analysis of the full-length sequence shown in Figure 324 (SEQ ID N0:51 1), evidenced sequence 
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identity between the PR09828 amino acid sequence and the following Dayhoif sequences: P_Y08581, 
AB018122_1, FGF3_HUMAN, P_R70824, S54407, P_R80780, P_Y23761, P_W92312, 0MFGF6_1 and 
P_R80871. 

EXAMPLE 176 : Isolation of c DNA Clones Encoding a Human PRO7170 

DNA108722-2743 was identified by ^plying a proprietary signal sequence finding algorithm 
developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as clustered and assembled EST 
fragments from public (e.g., Genbank) and/or private (LIFESEQ®, Incyte Pharmaceuticals, Inc., Palo Alto, 
CA) databases. The signal sequence algorithm computes a secretion signal score based on the character of the 
DNA nucleotides surrounding the first and optionally the second methionine codon(s) (ATG) at the 5 '-end of 
the sequence or sequence fragment under consideration. The nucleotides following the first ATG must code 
for at least 35 unambiguous amino acids without any stop codons. If the first ATG has the required amino 
acids, the second is not examined. If neither meets the requirement, the candidate sequence is not scored. In 
order to determine whether the EST sequence contains an authentic signal sequence, the DNA and 
corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven sensors 
(evaluation parameters) known to be associated with secretion signals. 

Use of the above described signal sequence algorithm allowed identification of an EST cluster 
sequence from the LIFESEQ® database, Incyte Pharmaceuticals, Palo Alto, designated herein as CLU57836. 
This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (e.g., Genbank) and a proprietary EST DNA database (LIFESEQ* Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul ei al., Metiiods in Enzvmology 266:460-480 (1996)). 
Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode 
known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil 
Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA58756. 

In light of an observed sequence homology between the DNA58756 sequence and an EST sequence 
encompassed within clone no. 225 1462 from the LIFESEQ® database, Incyte Pharmaceuticals, Palo Alto, CA, 
clone no. 225 1462 was purchased and the cDNA insert was obtained and sequenced. It was found herein that 
that cDNA insert encoded a full-length protein. The sequence of this cDNA insert is shown in Figure 325 and 
is herein designated as DNA 108722-2743. 

Clone DNA 108722-2743 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 60-62 and ending at the stop codon at nucleotide positions 1506-1508 (Figure 325). 
The predicted polypeptide precursor is 482 amino acids long (Figure 326). The full-length PR07 170 protein 
shown in Figure 326 has an estimated molecular weight of about 49,060 daltons and a pi of about 4.74. 
Analysis of the full-length PRO7170 sequence shown in Figure 326 (SEQ ID N0:5 13) evidences the presence 
of a variety of important polypeptide domains as shown in Figure 326, wherein the locations given for those 
imponant polypeptide domains are approximate as described above. Clone DNA 108722-2743 has been 
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deposited with ATCC on August 17, 1999 and is assigned ATCC Deposit No. 552-PTA. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using the ALIGN-2 sequence 
alignment analysis of the full-length sequence shown in Figure 326 (SEQ ID NO:513), evidenced sequence 
identity between the PR07 170 amino acid sequence and the following Dayhoff sequences: P_Y12291, 147141, 
D88733_l. DMC56G7_1, P_Y11606, HWP1_CANAL. HSMLfC5BEX_l, HSU78550_1, HSU70136_1, and 
5 SGS3_DR0ME. 

EXAMPLE 177 : Isolation of cDNA Clones Encoding Human PR0361 

A consensus DN A sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA40654. Based on the DNA40654 
10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0361. 

Forward and reverse PGR primers were synthesized as follows: 
forward PGR orimer 5'-AGGGAGGATTATGCTTGAGCTTTGAAGACG-3' (SEQ ID NO:528) 
15 forward PGR primer 5'-GAAGCAAGTGCGCAGGTC-3' (SEQ ID NO:529) 

forward PGR primer 5'-CGGGTGGGTGGTGTTTGG-3' (SEQ ID NO: 530) 

reverse PGR primer 5'-GAGCGTAGGTGGGAGGGGAGTCAG-3' (SEQ ID NO:531) 

reverse PGR primer 5*-AGTGTAAGTGAAGCTGGG-3' (SEQ ID NO:532) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
20 DNA40654 sequence which had the following nucleotide sequence 
hybridization probe 

5*- GGTTCCTGAGAGTAAGGGTGTCTGGTAGTGAGAATTGCGTGAAAAAGAG-3' (SEQ ID NO:533) 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PGR amplification with one of die PGR primer pairs identified above. A positive library was then 
25 used to isolate clones encoding the PR0361 gene using the probe oligonucleotide. RNA for construction of 

the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0361 [herein designated as DNA45410-1250] (SEQ ID NO:514) and the derived protein sequence for 

PR0361. 

30 The entire nucleotide sequence of DNA454 10- 1250 is shown in Figure 327 (SEQ ID NO:514). Glone 

DNA45410-1250 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 226-228 and ending at the stop codon at nucleotide positions 1519-1521 (Figure 327). The predicted 
polypeptide precursor is 431 amino acids long (Figure 328). The full-length PR0361 protein shown in Figure 
328 has an estimated molecular weight of about 46,810 daltons and a pl of about 6.45. In addition, regions 

35 of interest including the transmembrane domain (amino acids 380-409) and sequences typical of the arginase 
family of proteins (amino acids 3-14 and 39-57) are designated in Figure 328. Glone DNA45410-1250 has 
been deposited with ATGG and is assigned ATGG deposit no. ATGG 209621 . 
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Analysis of the amino acid sequence of the full-length PR0361 polypeptide suggests that portions of 
it possess significant homology to the mucin and/or chitinase proteins, thereby indicating that PR0361 may 
be a novel mucin and/or chitinase protein. 

EXAMPLE 178 : Isolation of cDNA Clones Encoding a Human PR0183. PR0184. PR0185. PR05723. 
5 PRO330t or PRO9940 

DNA molecules encoding the PR0183, PR0184, PR0185. PR05723, PRO3301 or PRO9940 
polypeptides shown in the accompanying figures were obtained through GenBank. 

Deposit of Material 

10 The following materials have been deposited with the American Type Culmre Collection, 10801 

University Blvd., Manassas, VA 20U0-2209, USA (ATCC): 



Table 10 





Material 


ATCC Deo. No. 


Deposit Date 


Ij 


DNA45410-I250 


209621 


rebruary j, lyyo 




DNA 1 08722-2743 


ceo DTP A 

552-r 1 A 


Aiimic?* 1*7 1 OOQ 

August 1 /, lyyy 




DNA142238-2700 


oiy-r 1 A 


v/ciooer lyyy 




UNA4uyol-lZ34 








JJIN A* J** I y- 1 Zj z 




i^CDruary j, i>yo 








M^»v f\ 190S 




HN A 1 ^%499- 1 9ft0 


209929 


June 2 1998 




DNA16435-1208 


209930 


June 2, 1998 




DNA21624-139I 


209917 


June 2, 1998 




DNA23334-1392 


209918 


June 2, 1998 


25 


DNA26288-1239 


209792 


April 21, 1998 




DNA26843-I389 


203099 


August 4, 1998 




DNA26844-1394 


209926 


June 2, 1998 




DNA30862-1396 


209920 


June 2, 1998 




DNA35680-1212 


209790 


April 21, 1998 


30 


DNA40621-1440 


209922 


June 2, 1998 




DNA44161-1434 


209907 


May 27. 1998 




DNA44694-1500 


203114 


August 11, 1998 




DNA45495-1550 


203156 


August 25, 1998 




DNA47361-1154 


209431 


November 7, 1997 


35 


DNA47394-1572 


203109 


August 11, 1998 




DNA48320-1433 


209904 


May 27, 1998 




DNA48334-1435 


209924 


June 2, 1998 




DNA48606-1479 


203040 


July 1, 1998 




DNA49141-1431 


203003 


June 23, 1998 


40 


DNA49I42-1430 


203002 


June 23, 1998 




DNA49143-1429 


203013 


June 23, 1998 




DNA49647-1398 


209919 


June 2. 1998 




DNA49819-1439 


209931 


June 2, 1998 




DNA49820-1427 


209932 


June 2, 1998 


45 


DNA49821-1562 


209981 


June 16, 1998 




DNA52192-1369 


203042 


July 1, 1998 




DNA52598-1518 


203107 


August 11, 1998 




DNA53913-1490 


203162 


August 25, 1998 
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Ta^?lp 10 (com') 





DNA53978-1443 


209983 


June 16. 1998 




DNA53996-1442 


209921 


June 2. 1998 




DNA56041-1416 


203012 


June 23, 1998 




DNA56047-1456 


209948 


June 9. 1998 


5 


DNA56050-1455 


2030 U 


June 23, 1998 




DNA56 110-1437 


203113 


August 11, 1998 




DNA56 113-1378 


203049 


July 1, 1998 




DNA56410-1414 


209923 


June 2, 1998 




DNA56436-1448 


209902 


May 27, 1998 


10 


DNA56855-1447 


203004 


June 23, 1998 




DNA56859-1445 


203019 


June 23, 1998 




DNA56860-1510 


209952 


June 9, 1998 




DNA56865-1491 


203022 


June 23, 1998 




DNA56866-1342 


203023 


June 23, 1998 


15 


DNA56868-1209 


203024 


June 23, 1998 




DNA56869-1545 


203161 


August 25, 1998 




DNA56870-1492 


209925 


June 2, 1998 




DNA57033-1403 


209905 


May 27, 1998 




DNA57037-1444 


209903 


May 27, 1998 


20 


DNA57129-1413 


209977 


June 16 1998 




DNAS7fiQfl-1374 


209950 


June 9 1998 




DNA57693-1424 


203008 


June 23, 1998 




DNA57694-1341 


203017 


June 23 1998 




DNA57695-1340 


203006 


June 23 1998 


25 


DNA57699-1412 


203020 


June 23, 1998 




DN A S7702- 1 476 


209951 


June 9 1998 




DN A57704-1 452 


209953 


June 9 1998 




DNA57708-1411 


203021 


June 23, 1998 




DNA57710-1451 


203048 


July 1, 1998 


30 


DNA5771 1-1501 


203047 


July 1, 1998 




DNA57827-1493 


203045 


July 1, 1998 




DNA57834-1339 


209954 


June 9, 1998 




DNA57836-1338 


203025 


June 23, 1998 




DNA 57838-1337 


203014 


June 23 1998 


35 


DNA57844-1410 


203010 


June 23, 1998 




DNA5872 1-1475 


203110 


Aueust 11. 1998 




DNA5 8723-1 588 


203133 


August 18. 1998 




DNA5 8737- 1473 


203136 


August 18, 1998 




DNA58743-1609 


203154 


August 25, 1998 


40 


DNA58846-1409 


209957 


June 9, 1998 




DNA5 8848-1472 


209955 


June 9, 1998 




DNA58849-1494 


209958 


June 9. 1998 




DNA58850-1495 


209956 


June 9, 1998 




DNA58853-1423 


203016 


June 23, 1998 


45 


DNA58855-1422 


203018 


June 23, 1998 




DNA59205-1421 


203009 


June 23, 1998 




DNA592 11-1450 


209960 


June 9, 1998 




DNA59213-1487 


209959 


June 9, 1998 




DNA59214-1449 


203046 


July 1, 1998 


50 


DNA59215-1425 


209961 


June 9, 1998 




DNA59220-1514 


209962 


June 9, 1998 




DNA59488-1603 


203157 


August 25, 1998 




DNA59493.1420 


203050 


July 1, 1998 




DNA59497-1496 


209941 


June 4, 1998 


55 


DNA59588-1571 


203106 


August n, 1998 
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Table 10 fcont') 





DNA59603-1419 


209944 


June 9, 1998 




DNA59605-1418 


203005 


June 23, 1998 




DNA59606-1471 


209945 


June 9, 1998 




DNA59607-1497 


209957 


June 9, 1998 


5 


DNA59609-1470 


209963 


June 9, 1998 




DNA59610.1559 


209990 


June 16, 1998 




DNA59612-1466 


209947 


June 9, 1998 




DNA59613-1417 


203007 


June 23, 1998 




DNA59616-1465 


209991 


June 16, 1998 


10 


DNA59619-1464 


203041 


July 1. 1998 




DNA59620-1463 


209989 


June 16. 1998 




DNA59625-1498 


209992 


June 17. 1998 




DNA59767-1489 


203108 


August 11. 1998 




DNA59776-1600 


203128 


August 18, 1998 


15 


DNA59777-1480 


203U1 


August 11, 1998 




DNA59820-1549 


203129 


August 18, 1998 




DNA59827-1426 


203089 


August 4, 1998 




DNA59828-1608 


203158 


August 25, 1998 




DNA59838-1462 


209976 


June 16. 1998 


20 


DNA59839-1461 


209988 


June 16, 1998 




DNA59841-1460 


203044 


July 1, 1998 




DNA59842-1502 


209982 


June 16. 1998 




DNA59846-1503 


209978 


June 16, 1998 




DNA59847-15U 


203098 


August 4. 1998 


25 


DNA59848-1512 


203088 


August 4, 1998 




DNA59849-1504 


209986 


June 16. 1998 




DNA59853-1505 


209985 


June 16. 1998 




DNA59854-1459 


209974 


June 16. 1998 




DNA60283-1484 


203043 


July 1. 1998 


30 


DNA60615-1483 


209980 


June 16. 1998 




DNA60619-1482 


209993 


June 16. 1998 




DNA60621-1516 


203091 


August 4, 1998 




DNA60622-1525 


203090 


August 4, 1998 




DNA60625-1507 


209975 


June 16. 1998 


35 


DNA60627-1508 


203092 


August 4, 1998 




DNA60629-1481 


209979 


June 16. 1998 




DNA61755-1554 


203112 


August U. 1998 




DNA6 1873- 1574 


203132 


August 18, 1998 




DNA62814-1521 


203093 


August 4, 1998 


40 


DNA62872-1509 


203100 


August 4, 1998 




DNA62876-1517 


203095 


August 4, 1998 




DNA62881-1515 


203096 


August 4, 1998 




DNA64852-1589 


203127 


August 18. 1998 




DNA64884-1527 


203155 


August 25. 1998 


45 


DNA64890-1612 


203131 


August 18. 1998 




DNA65412-1523 


203094 


August 4, 1998 




DNA66308-1537 


203159 


August 25, 1998 




DNA66309-1538 


203235 


September 15, 1998 




DNA67004-1614 


203U5 


August U, 1998 


50 


DNA68869-1610 


203164 


August 25, 1998 




DNA68872-1620 


203160 


August 25, 1998 




DNA7 11 59-1617 


203135 


August 18. 1998 
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These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder 
(Budapest Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of 
deposit. The deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject 
to an agreement between Genentech, Inc. and ATCC» which assures permanent and unrestricted availability 
5 of the progeny of the culmre of die deposit to die public upon issuance of the pertinent U.S. patent or upon 
laying open to the public of any U.S. or foreign patent application, whichever comes first, and assures 
availability of the progeny to one determined by the U.S. Commissioner of Patents and Trademarks to be 
entitled thereto according to 35 USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR 
§1. 14 with particular reference to 886 OG 638). 

10 The assignee of the present application has agreed that if a culture of the materials on deposit should 

die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced 
on notification widi another of the same. Availability of the deposited material is not to be construed as a 
license to practice the invention in contravention of the rights granted under the autfiority of any government 
in accordance with its patent laws. 

15 The foregoing written specification is considered to be sufficient to enable one skilled in the art to 

practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are fimctionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 

20 aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

I. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 
(SEQ ID N0:14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23). Figure 18 (SEQ ID NO:28), 
5 Figure 20 (SEQ ID N0:3Q), Figure 23 (SEQ ID NO:33). Figure 25 (SEQ ID NO:36). Figure 27 (SEQ ID 
N0:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 
(SEQ ID NO:62), Figure 38 (SEQ ID NO:67)» Figure 41 (SEQ ID NO:73). Figure 47 (SEQ ID NO:84), 
Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID 
NO:103), Figure 64 (SEQ ID NO:113), Figure 66 (SEQ ID N0:H5), Figure 68 (SEQ ID NO:il7). Figure 

10 70 (SEQ ID NO:H9), Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID 
NO:135). Figure 79 (SEQ ID NO:138). Figure 83 (SEQ ID NO:146), Figure 85 (SEQ ID NO:148). Figure 
88 (SEQ ID NO:151), Figure 90 (SEQ ID NO:153), Figure 93 (SEQ ID NO:156), Figure 95 (SEQ ID 
NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167). Figure 
103 (SEQ ID NO:169)» Figure 105 (SEQ ID NO:171), Figure 109 (SEQ ID NO:175), Figure 111 (SEQ ID 

15 NO:177), Figure 113 (SEQ ID NO: 179), Figure 115 (SEQIDNO:181), Figure 117 (SEQ ID NO: 183). Figure 
120 (SEQ ID NO:189), Figure 122 (SEQ ID NO:194), Figure 125 (SEQ ID NO:197), Figure 127 (SEQ ID 
NO: 199), Figure 129 (SEQ ID NO:201). Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205). Figure 
135 (SEQ ID NO:207), Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID N0:2n), Figure 141 (SEQ ID 
NO:213), Figure 144(SEQIDNO:2I6), Figure 147 (SEQIDNO:219), Figure 149(SEQIDNO:221), Figure 

20 * 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID 
NO:229), Figure 159 (SEQ ID NO:23 1), Figure 161 (SEQ ID NO:236). Figure 163 (SEQ ID NO:241). Figure 
165 (SEQ ID NO:246)» Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID 
NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), Figure 
180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID 

25 NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274). Figure 
194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID 
NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287). Figure 206 (SEQ ID NO;289). Figure 
208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID 
NO:297), Figure 216 (SEQ ID NO:299). Figure 218 (SEQ ID NO:301). Figure 220 (SEQ ID NO:303). Figure 

30 226 (SEQ ID NO:309), Figure 228 (SEQ ID N0:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID 
NO:326), Figure 235 (SEQ ID NO:334). Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), Figure 
242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:351). Figure 248 (SEQ ID 
NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), Figure 
256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID 

35 NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373). Figure 
270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID 
NO:381), Figure 278 (SEQ ID NO:387). Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394). Figure 
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284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ ID 
NO:408), Figure 292 (SEQ ID NO:410). Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), Figure 
298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420). Figure 304 (SEQ ID 
NO:422). Figure 306 (SEQ ID NO:424), Figure 308 (SEQ ID NO:495), Figure 310 (SEQ ID NO:497). Figure 
312 (SEQ ID NO:499). Figure 314 (SEQ ID NO:501), Figure 316 (SEQ ID NO:503), Figure 318 (SEQ ID 
5 NO:505), Figure 320 (SEQ ID NO:507), Figure 322 (SEQ ID NO:509), Figure 324 (SEQ ID N0:51 1). Figure 
326 (SEQ ID NO:513), Figure 328 (SEQ ID NO:515) and Figure 330 (SEQ ID NO:517). 

2. The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a 
nucleotide sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO:l), 

10 Figure 3 (SEQ ID NO:5). Figure 5 (SEQ ID N0:7), Figure 8 (SEQ ID NO: 13), Figure 1 1 (SEQ ID NO: 19). 
Figure 14 (SEQ ID NO:22). Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ ID 
NO:32). Figure 24 (SEQ ID NO:35). Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID N0:51). Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID N0:61), Figure 37 (SEQ ID NO:66), 
Figure 40 (SEQ ID NO:72). Figure 46 (SEQ ID NO:83). Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID 

15 NO:96), Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102). Figure 63 (SEQ ID NO: 1 12), Figure 65 
(SEQ ID NO: 1 14). Figure 67 (SEQ ID NO: 1 16), Figure 69 (SEQ ID NO: 1 18), Figure 71 (SEQ ID NO: 123), 
Figure 73 (SEQ ID NO: 128). Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ 
ID NO: 1 45), Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150). Figure 89 (SEQ ID NO: 152), Figure 
92 (SEQ ID NO:155), Figure 94 (SEQ ID NO:157). Figure 96 (SEQ ID NO:159). Figure 98 (SEQ ID 

20 NO: 164). Figure 100 (SEQ ID NO: 166). Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 
108 (SEQ ID NO: 174). Figure 110 (SEQ ID NO: 176), Figure 112 (SEQ ID NO: 178). Figure 114 (SEQ ID 
NO: 180), Figure 1 16 (SEQ ID NO: 182). Figure 119 (SEQ ID NO: 188). Figure 121 (SEQ ID NO: 193), Figure 
124 (SEQ ID NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID 
NO:202), Figure 132 (SEQ ID NO:204). Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 

25 138 (SEQ ID NO:210). Figure 140 (SEQ ID NO:212). Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID 
NO:218). Figure 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222). Figure 152 (SEQ ID NO:224), Figure 
154 (SEQ ID NO:226), Figure 156 (SEQ ID NO:228). Figure 158 (SEQ ID NO:230). Figure 160 (SEQ ID 
NO:235), Figure 162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO;247), Figure 
168 (SEQ ID NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255). Figure 175 (SEQ ID 

30 NO:257), Figure 177 (SEQ ID NO: 259), Figure 179(SEQIDNO:261), Figure 181 (SEQ ID NO:263), Figure 
183 (SEQ ID NO:265), Figure 185 (SEQ ID NO:267). Figure 187 (SEQ ID NO:269). Figure 189 (SEQ ID 
NO;271), Figure 191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275). Figure 195 (SEQ ID NO:277), Figure 
197 (SEQ ID NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID 
NO:286), Figure 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 

35 21 1 (SEQ ID NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298). Figure 217 (SEQ ID 
NO:300), Figure 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO;308), Figure 227 (SEQ ID NO:313). Figure 
229 (SEQ ID NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID 
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NO:339), Figure 239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348). Figure 
245 (SEQ ID NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID 
NO:356). Figure 253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 
259 (SEQ ID NO:364)» Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368). Figure 265 (SEQ ID 
NO:370), Figure 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 
5 273 (SEQ ID NO:378). Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID 
NO:388), Figure 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 
287 (SEQ ID NO:402), Figure 289 (SEQ ID NO:407). Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID 
N0:41 1). Figure 295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 
301 (SEQ ID NO:419), Figure 303 (SEQ ID NO:421). Figure 305 (SEQ ID NO:423), Figure 307 (SEQ ID 
10 NO:494), Figure 309 (SEQ ID NO:496), Figure 311 (SEQ ID NO:498), Figure 313 (SEQ ID NO:500), Figure 
315 (SEQ ID n6:502). Figure 317 (SEQ ID NO:504), Figure 319 (SEQ ID NO:506). Figure 321 (SEQ ID 
NO:508), Figure 323 (SEQ ID NO:510), Figure 325 (SEQ ED NO:512). Figure 327 (SEQ ID NO;514) and 
Figure 329 (SEQ ID NO:5 16). 

15 3. The nucleic acid of Claim 1, wherein said nucleotide sequence comprises a nucleotide 

sequence selected from the group consisting of the fiill-length coding sequence of the sequence shown in Figure 

I (SEQ ID N0:1), Figure 3 (SEQ ID N0:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO:13), Figure 

II (SEQ ID NO:19). Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27). Figure 19 (SEQ ID NO:29). 
Figure 22 (SEQ ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID 

20 NO:46), Figure 31 (SEQ ID NO:51). Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 
(SEQ ID NO:66). Figure 40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), 
Figure 50 (SEQ ID NO:96), Figure 52 (SEQ ID NO:98). Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID 
NO: 112). Figure 65 (SEQ ID NO: 1 14), Figure 67 (SEQ ID NO: 116), Figure 69 (SEQ ID NO:l 18), Figure 
71 (SEQ ID NO: 123). Figure 73 (SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID 

25 NO:137), Figure 82 (SEQ ID NO:145), Figure 84 (SEQ ID NO:147), Figure 87 (SEQ ID NO:150). Figure 
89 (SEQ ID NO: 152), Figure 92 (SEQ ID NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID 
NO:159), Figure 98 (SEQ ID NO: 164). Figure 100 (SEQ ID NO:166), Figure 102 (SEQ ID NO:168), Figure 
104 (SEQ ID NO: 170), Figure 108 (SEQ ID NO: 174), Figure 110 (SEQ ID NO: 176), Figure 112 (SEQ ID 
NO: 178), Figure 1 14 (SEQ ID NO: 180). Figure 1 16 (SEQ ID NO: 182), Figure 1 19 (SEQ ID NO: 188), Figure 

30 121 (SEQ ID NO:193), Figure 124 (SEQ ID NO:196), Figure 126 (SEQ ID NO:198), Figure 128 (SEQ ID 
NO;200), Figure 130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 
136 (SEQ ID NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ ID NO:212). Figure 143 (SEQ ID 
NO:215), Figure 146(SEQIDNO:218), Figure 148 (SEQ ID NO:220), Figure 150(SEQIDNO:222), Figure 
152 (SEQ ID NO:224), Figure 154 (SEQ ID NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID 

35 NO:230), Figure 160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 
166 (SEQ ID NO:247), Figure 168 (SEQ ID NO:249). Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID 
NO:255), Figure 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 
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181 (SEQ ID Nb:263), Figure 183 (SEQ ID NO:265). Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID 
NO:269). Figure 189 (SEQ ID NO:271). Figure 191 (SEQ ID NO:273). Figure 193 (SEQ ID NO:275). Figure 
195 (SEQ ID NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID 
NO:284), Figure 203 (SEQ ID NO:286), Figure 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 
209 (SEQ ID NO:292), Figure 211 (SEQ ID NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID 
NO:298), Figure 217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308)» Figure 
227 (SEQ ID NO:313), Figure 229 (SEQ ID NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID 
NO:333), Figure 237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 
243 (SEQ ID NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID 
NO:354), Figure 25 1 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358). Figure 255 (SEQ ID NO:360), Figure 
257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID 
NO:368). Figure 265 (SEQ ID NO:370), Figure 267 (SEQ ED NO:372), Figure 269 (SEQ ID NO:374), Figure 
271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID 
NO:386). Figure 279 (SEQ ID NO:388). Figure 281 (SEQ ID NO:393). Figure 283 (SEQ ID NO:398), Figure 
285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ ID NO:407). Figure 291 (SEQ ID 
NO:409). Figure 293 (SEQ ID N0:41 1), Figure 295 (SEQ ID NO:413). Figure 297 (SEQ ID NO:41 5), Figure 
299 (SEQ ID NO:417), Figure 301 (SEQ ID NO:419). Figure 303 (SEQ ID NO:421). Figure 305 (SEQ ID 
NO:423). Figure 307 (SEQ ID NO:494). Figure 309 (SEQ ID NO:496), Figure 3 1 1 (SEQ ID NO:498), Figure 
313 (SEQ ID NO:500), Figure 315 (SEQ ID NO:502), Figure 317 (SEQ ID NO:504), Figure 319 (SEQ ID 
NO:506), Figure 321 (SEQ ID NO:508), Figure 323 (SEQ ID NO:510), Figure 325 (SEQ ID NO:512). Figure 
327 (SEQ ID NO:514) and Figure 329 (SEQ ID NO:516). 

4 . Isolated nucleic acid which comprises the full-length coding sequence of the DN A deposited 
under any ATCC accession number shown in Table 10. 

5. A vector comprising the nucleic acid of Claim 1 . 

6. The vector of Claim 5 operably linked to control sequences recognized by a host cell 
transformed with the vector. 

7. A host cell conyjrising the vector of Claim 5. 

8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an £. coli, 

10. The host cell of Claim 7 wherein said cell is a yeast cell. 
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U . A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from 
the cell culture. 



12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 
5 selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID N0:2), Figure 4 
(SEQ ID N0:6), Figure 6 (SEQ ID N0:8), Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20). Figure 
15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28). Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33). 
Figure 25 (SEQ ID NO:36). Figure 27 (SEQ ID N0:41). Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID 
NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 

10 (SEQ ID NO:73). Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), 
Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO: 113), Figure 66 (SEQ ID 
NO: 1 15). Figure 68 (SEQ ID NO: 1 17), Figure 70 (SEQ ID NO: 1 19), Figure 72 (SEQ ID NO: 124), Figure 
74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID 
NO: 146), Figure 85 (SEQ ID NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 (SEQ ID NO: 153), Figure 

15 93 (SEQ ID NO: 156). Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID 
NO:165). Figure 101 (SEQ ID NO: 167), Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 171), Figure 
109 (SEQ ID NO: 175), Figure 111 (SEQ ID NO: 177), Figure 113 (SEQ ID NO: 179), Figure 115 (SEQ ID 
NO: 181). Figure 1 17 (SEQ ID NO: 183). Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ ID NO: 194). Figure 
125 (SEQ ID NO:197), Figure 127 (SEQ ID NO:199). Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID 

20 NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207). Figure 137 (SEQ ID NO:209). Figure 
139 (SEQ ID NO:211), Figure 141 (SEQ ID NO:213), Figure 144 (SEQ ID NO:216). Figure 147 (SEQ ID 
NO:219). Figure 149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 
155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID 
NO:236). Figure 163 (SEQ ID NO:24l), Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 

25 169 (SEQ ID NO:250). Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256). Figure 176 (SEQ ID 
NO:258), Figure 178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO;264). Figure 
184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268). Figure 188 (SEQ ID NO:270). Figure 190 (SEQ ID 
NO:272). Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276). Figure 196 (SEQ ID NO:278), Figure 
198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID 

30 NO:287). Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 
212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), Figure 218 (SEQ ID 
NO:301). Figure 220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID N0:3 14), Figure 
230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID 
NO:340). Figure 240 (SEQ ID NO:345). Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 

35 246 (SEQ ID NO:35I). Figure 248 (SEQ ID NO:353). Figure 250 (SEQ ID NO:355). Figure 252 (SEQ ID 
NO:357). Figure 254 (SEQ ID NO:359). Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 
260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID 
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NO:371), Figure 268 (SEC^Ro:373), Figure 270 (SEQ !DNO:375), FigureW(SEQIDNO:377). Figure 
274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID 
NO:389). Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401). Figure 
288 (SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID 
NO:412), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418). Figure 
302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422), Figure 306 (SEQ ID NO:424), Figure 308 (SEQ ID 
NO:495), Figure 310 (SEQ ID NO:497). Figure 3 12 (SEQ ID NO:499), Figure 314 (SEQ ID NO:501). Figure 
316 (SEQ ID NO:503), Figure 318 (SEQ ID NO:505). Figure 320 (SEQ ID NO:507), Figure 322 (SEQ ID 
NO:509), Figure 324 (SEQ ID N0:51l). Figure 326 (SEQ ID NO:513). Figure 328 (SEQ ID NO:515) and 
Figure 330 (SEQ ID NO:517). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under any ATCC accession number shown in Table 10. 

14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
15 amino acid sequence. 

15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

20 16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 

region of an immunoglobulin. 

17. An antibody which specifically binds to a PRO polypeptide according to Claim 12. 

25 18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 

19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 

30 

21. An isolated nucleic acid molecule which has at least 80 % sequence identity to a nucleic acid 
which comprises a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID N0:7). Figure 8 (SEQ ID NO: 13). Figure 11 (SEQ ID 
N0:19). Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27). Figure 19 (SEQ ID NO:29), Figure 22 

35 (SEQ ID NO:32). Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46). 
Figure 31 (SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID N0:61), Figure 37 (SEQ ID 
NO:66), Figure 40 (SEQ ID N0:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 
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(SEQ ID NO:96), Figure 52 (SEQ ID NO:98)» Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), 
Figure 65 (SEQ ID NO: 1 14), Figure 67 (SEQ ID NO: 1 16), Figure 69 (SEQ ID NO: 1 18), Figure 7 1 (SEQ 
ID NO: 1 23), Figure 73 (SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 1 37), Figure 
82 (SEQ ID NO: 145), Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID 
NO:152), Figure 92 (SEQ ID NO:155), Figure 94 (SEQ ID NO:I57), Figure 96 (SEQ ID NO:159), Figure 
5 98 (SEQ ID NO:164), Figure 100 (SEQ ID NO:166). Figure 102 (SEQ ID NO:l68), Figure 104 (SEQ ID 
NO: 170), Figure 108 (SEQ ID NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO; 178). Figure 
1 14 (SEQ ID NO: 180), Figure 1 16 (SEQ ID NO: 182), Figure 1 19 (SEQ ID NO: 188). Figure 121 (SEQ ID 
NO: 193), Figure 124 (SEQ ID NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 
130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID 

10 NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 
146 (SEQ ID NO:218), Figure 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID 
NO:224), Figure 154 (SEQ ID NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 
160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID 
NO:247), Figure 168 (SEQ ID NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 

15 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID . 
NO:263). Figure 183 (SEQ ID NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 
189 (SEQ ID NO:271). Figure 191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID 
NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 
203 (SEQ ID NO:286). Figure 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID 

20 NO:292), Figure21 1 (SEQ ID NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 
217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID 
N0:3 13). Figure 229 (SEQ ID NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 
237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID 
NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 

25 251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358). Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID 
NO:362), Figure 259 (SEQ ID NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368). Figure 
265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID 
NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ ID NO:380). Figure 277 (SEQ ID NO:386), Figure 
279 (SEQ ID NO:388), Figure 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID 

30 NO:400), Figure 287 (SEQ ID NO:402). Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 
293 (SEQ ID NO:411), Figure 295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID 
NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ ID NO:421), Figure 305 (SEQ ID NO:423), Figure 
307 (SEQ ID NO:494), Figure 309 (SEQ ID NO:496), Figure 311 (SEQ ID NO:498), Figure 313 (SEQ ID 
NO:500), Figure 315 (SEQ ID NO:502), Figure 3 17 (SEQ ID NO:504), Figure 319 (SEQ ID NO:506). Figure 

35 321 (SEQ ID NO:508), Figure 323 (SEQ ID NO:510), Figure 325 (SEQ ID NO:512). Figure 327 (SEQ ID 
NO:514) and Figure 329 (SEQ ID N0;516). 
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22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 
coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ 
ID N0:1). Figure 3 (SEQ ID N0:5), Figure 5 (SEQ ID N0:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ 
ID NO: 19). Figure 14 (SEQ ID NO:22). Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 
(SEQ ID NO:32), Figure 24 (SEQ ID NO:35). Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), 
5 Figure 31 (SEQ ID NO:51). Figure 33 (SEQ ID NO:56). Figure 35 (SEQ ID NO:6I), Figure 37 (SEQ ID 
NO:66), Figure 40 (SEQ ID NO:72). Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 
(SEQ ID NO:96), Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 1 12). 
Figure 65 (SEQ ID N0:114). Figure 67 (SEQ ID N0:116), Figure 69 (SEQ ID N0:118), Figure 71 (SEQ 
ID NO: 123), Figure 73 (SEQ ID NO: 128), Figure 75 (SEQ ID NO: 1 34). Figure 78 (SEQ ID NO; 137). Figure 

10 82 (SEQ ID NO:145), Figure 84 (SEQ ID NO:147), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID 
NO:152). Figure 92 (SEQ ID NO:155). Figure 94 (SEQ ID NO:157), Figure 96 (SEQ ID NO:159). Figure 
98 (SEQ ID NO: 164). Figure 100 (SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID 
NO:170). Figure 108 (SEQ ID NO: 174). Figure 110 (SEQ ID NO: 176), Figure 112 (SEQ ID NO: 178). Figure 
114 (SEQ ID NO:180), Figure 116 (SEQ ID NO:182), Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID 

15 NO: 193). Figure 124 (SEQ ID NO:196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200). Figure 
130 (SEQ ID NO:202). Figure 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID 
NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215). Figure 
146 (SEQ ID NO:2I8), Figure 148 (SEQ ID NO:220). Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID 
NO:224). Figure 154 (SEQ ID NO:226), Figure 156 (SEQ ID NO:228). Figure 158 (SEQ ID NO:230), Figure 

20 160 (SEQ ID NO:235). Figure 162 (SEQ ID NO:240). Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID 
NO:247). Figure 168 (SEQ ID NO:249). Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 
175 (SEQ ID NO:257). Figure 177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID 
NO:263). Figure 183 (SEQ ID NO:265). Figure 185 (SEQ ID NO:267). Figure 187 (SEQ ID NO:269). Figure 
189 (SEQ ID NO:271), Figure 191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID 

25 NO:277). Figure 197 (SEQ ID NO:280), Figure 199 (SEQ ID NO:282). Figure 201 (SEQ ID NO:284)» Figure 
203 (SEQ ID NO:286). Figure 205 (SEQ ID NO:288). Figure 207 (SEQ ID NO:290). Figure 209 (SEQ ID 
NO:292). Figure 21 1 (SEQ ID NO:294). Figure213 (SEQ ID NO:296). Figure 215 (SEQ ID NO:298), Figure 
217 (SEQ ID NO:300). Figure 219 (SEQ ID NO:302). Figure 225 (SEQ ID NO:308). Figure 227 (SEQ ID 
NO:313). Figure 229 (SEQ ID NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 

30 237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344). Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID 
NO:348). Figure 245 (SEQ ID NO;350). Figure 247 (SEQ ID NO:352). Figure 249 (SEQ ID NO:354), Figure 
251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID 
NO:362). Figure 259 (SEQ ID NO:364). Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368). Figure 
265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372). Figure 269 (SEQ ID NO:374). Figure 271 (SEQ ID 

35 NO:376). Figure 273 (SEQ ID NO:378), Figure 275 (SEQ ID NO:380). Figure 277 (SEQ ID NO:386), Figure 
279 (SEQ ID NO:388). Figure 281 (SEQ ID NO:393). Figure 283 (SEQ ID NO:398). Figure 285 (SEQ ID 
NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ ID NO:407). Figure 291 (SEQ ID NO:409). Figure 
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293 (SEQ ID N0:41 1), Figure 295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID 
NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ ID NO:421), Figure 305 (SEQ ID NO:423), Figure 
307 (SEQ ID NO:494), Figure 309 (SEQ ID NO:496), Figure 311 (SEQ ID NO:498), Figure 313 (SEQ ID 
NO:500), Figure 315 (SEQ ID NO:502), Figure 317 (SEQ ID NO:504), Figure 319 (SEQ ID NO:506), Figure 
321 (SEQ ID NO:508), Figure 323 (SEQ ID NO:510). Figure 325 (SEQ ID NO:512), Figure 327 (SEQ ID 
5 NO:514) and Figure 329 (SEQ ID NO:516). 

23 . An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

10 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
extracellular domain of of PRO polypeptide. 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
15 polypeptide lacking its associated signal peptide. 

27. Isolated nucleic acid having at least 80% nucleic acid sequence identity to: 

(a) a nucleotide sequence encoding the polypeptide shown in Figure 2 (SEQ ID NO:2), Figure 
4 (SEQ ID N0:6), Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20). Figure 

20 15 (SEQ ID NO:23). Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33). 

Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID N0:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID 
NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 
(SEQ ID NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), 
Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO:l 13), Figure 66 (SEQ ID 

25 NO: 1 15). Figure 68 (SEQ ID NO: 1 17), Figure 70 (SEQ ID NO: 1 19), Figure 72 (SEQ ID NO: 124). Figure 
74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138). Figure 83 (SEQ ID 
NO:146), Figure 85 (SEQ ID NO:148), Figure 88 (SEQ ID NO: 151). Figure 90 (SEQ ID NO:153), Figure 
93 (SEQ ID NO: 156), Figure 95 (SEQ ID NO: 158). Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID 
NO: 165), Figure 101 (SEQ ID NO: 167). Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 17 1), Figure 

30 109 (SEQ ID NO:175). Figure 111 (SEQ ID NO:177), Figure 113 (SEQ ID NO:179), Figure 115 (SEQ ID 
NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ ID NO: 194), Figure 
125 (SEQ ID NO: 197), Figure 127 (SEQ ID NO: 199). Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID 
NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207). Figure 137 (SEQ ID NO:209), Figure 
139 (SEQ ID NO:21 1). Figure 141 (SEQ ID NO:213). Figure 144 (SEQ ID NO:216). Figure 147 (SEQ ID 

35 NO:219). Figure 149 (SEQ ID NO:221). Figure 151 (SEQ ID NOi223). Figure 153 (SEQ ID NO:225). Figure 
155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID 
NO:236). Figure 163 (SEQ ID NO:241 ), Figure 165 (SEQ ID NO:246). Figure 167 (SEQ ID NO:248). Figure 



575 



wo 00/73454 




PCTAJSOO/08439 



169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID 
NO:258), Figure 178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 
184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID 
NO:272), Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278), Figure 
198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID 
5 NO:287). Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293)» Figure 
212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299). Figure 218 (SEQ ID 
NO:30I), Figure 220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309). Figure 228 (SEQ ID NO:3 14), Figure 
230 (SEQ ID NO:3I9), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID 
NO:340). Figure 240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 

10 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355). Figure 252 (SEQ ID 
NO:357). Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361). Figure 258 (SEQ ID NO:363). Figure 
260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID 
NO:371). Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377), Figure 
274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381). Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID 

15 NO:389), Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399). Figure 286 (SEQ ID N0:401), Figure 
288 (SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID 
NO:4l2). Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:4 16), Figure 300 (SEQ ID NO:418), Figure 
302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422), Figure 306 (SEQ ID NO:424), Figure 308 (SEQ ID 
NO:495), Figure310(SEQIDNO:497),Figure312(SEQIDNO:499), Figure 3 14 (SEQ ID NO:501), Figure 

20 316 (SEQ ID NO:503), Figure 318 (SEQ ID NO:505), Figure 320 (SEQ ID NO:507), Figure 322 (SEQ ID 
NO:509), Figure 324 (SEQ ID N0:511), Figure 326 (SEQ ID NO:513). Figure 328 (SEQ ID NO:515) or 
Figure 330 (SEQ ID NO: 5 17), lacking its associated signal peptide; 

(b) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 
2 (SEQ ID NO:2). Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8). Figure 9 (SEQ ID NO: 14), Figure 

25 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28). Figure 20 (SEQ ID NO:30), 
Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID N0:41), Figure 30 (SEQ ID 
NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62). Figure 38 
(SEQ ID NO:67). Figure 41 (SEQ ID NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), 
Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID 

30 NO: 1 13), Figure 66 (SEQ ID NO: 115), Figure 68 (SEQ ID NO: 1 17), Figure 70 (SEQ ID NO: 1 19), Figure 
72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129). Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO:138), Figure 83 (SEQ ID NO:146), Figure 85 (SEQ ID NO:148), Figure 88 (SEQ ID NO:151), Figure 
90 (SEQ ID NO:153), Figure 93 (SEQ ID NO:156), Figure 95 (SEQ ID NO:158), Figure 97 (SEQ ID 
NO:160), Figure 99 (SEQ ID NO: 165). Figure 101 (SEQ ID NO: 167), Figure 103 (SEQ ID NO: 169), Figure 

35 105 (SEQ ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 1 1 1 (SEQ ID NO: 177), Figure 1 13 (SEQ ID 
NO:179), Figure 1 15 (SEQ ID NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 
122 (SEQ ID NO: 194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ ID NO: 199). Figure 129 (SEQ ID 
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NO:201), Figure 131 (SEQIDNO:203). Figure 133 (SEQIDNO:205), Figure 135 (SEQ1DNO:207), Figure 
137 (SEQ ID NO:209), Figure 139 (SEQ ID NO:21I), Figure 141 (SEQ ID NO:213), Figure 144 (SEQ ID 
NO:216), Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 
153 (SEQ ID NO:225). Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID 
NO:231), Figure 161 (SEQIDNO:236). Figure 163(SEQIDNO:241), Figure 165 (SEQ ID NO: 246). Figure 
5 167 (SEQ ID NO:248). Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID 
NO:256), Figure 176 (SEQ ID NO:258). Figure 178 (SEQ ID NO:260). Figure 180 (SEQ ID NO:262), Figure 
182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268). Figure 188 (SEQ ID 
NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274). Figure 194 (SEQ ID NO:276), Figure 
196 (SEQ ID NO:278). Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID 

10 NO:285), Figure 204 (SEQ ID NO:287). Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 
210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295). Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID 
NO:299). Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309), Figure 
228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID 
NO:334). Figure 238 (SEQ ID NO:340). Figure 240 (SEQ ID NO:345). Figure 242 (SEQ ID NO:347). Figure 

15 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID 
NO:355). Figure 252 (SEQ ID NO:357). Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361), Figure 
258 (SEQ ID NO:363). Figure 260 (SEQ ID NO:365). Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID 
NO:369), Figure 266 (SEQ ID NO:37 1). Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375). Figure 
272 (SEQ ID NO:377). Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381). Figure 278 (SEQ ID 

20 NO:387), Figure 280 (SEQ ID NO:389). Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399), Figure 
286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403>, Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID 
NO:410), Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 
300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422), Figure 306 (SEQ ID 
NO:424), Figure 308 (SEQ ID NO:495), Figure 3 10 (SEQ ID NO:497), Figure 3 12 (SEQ ID NO:499). Figure 

25 314 (SEQ ID NO:501). Figure 316 (SEQ ID NO:503). Figure 318 (SEQ ID NO:505), Figure 320 (SEQ ID 
NO:507). Figure 322 (SEQ ID NO:509). Figure 324 (SEQ ID N0:5 1 1), Figure 326 (SEQ ID NO:5 13), Figure 
328 (SEQ ID NO:515) or Figure 330 (SEQ ID NO:517), with its associated signal peptide; or 

(c) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 
2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO: 14). Figure 

30 12 (SEQ ID NO:20). Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), 
Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID N0:41), Figure 30 (SEQ ID 
NO:47). Figure 32 (SEQ ID NO:52). Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 
(SEQ ID NO;67), Figure 41 (SEQ ID NO:73). Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), 
Figure 51 (SEQ ID NO:97). Figure 53 (SEQ ID NO:99), Figure .57 (SEQ ID NO:103). Figure 64 (SEQ ID 

35 NO: 1 13). Figure 66 (SEQ ID NO: 1 15), Figure 68 (SEQ ID NO: 1 1 7), Figure 70 (SEQ ID NO: 1 19), Figure 
72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO: 138), Figure 83 (SEQ ID NO: 146). Figure 85 (SEQ ID NO: 148). Figure 88 (SEQ ID NO: 151), Figure 
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90 (SEQ ID NO:153), Figure 93 (SEQ ID NO:156). Figure 95 (SEQ ID NO;158). Figure 97 (SEQ ID 
NO: 160), Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167). Figure 103 (SEQ ID NO: 169), Figure 
105 (SEQ ID N0:171), Figure 109 (SEQ ID N0:n5), Figure 111 (SEQ ID NO:177), Figure 113 (SEQ ID 
NO: 179), Figure 1 15 (SEQ ID NO: 181), Figure 117 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189). Figure 
122 (SEQ ID NO: 194). Figure 125 (SEQ ID NO:197), Figure 127 (SEQ ID NO:199). Figure 129 (SEQ ID 
5 NO:201), Figure 131 (SEQ ID NO:203). Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 
137 (SEQ ID NO:209). Figure 139 (SEQ ID N0:21l), Figure 141 (SEQ ID NO:213). Figure 144 (SEQ ID 
NO:216), Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221). Figure 151 (SEQ ID NO:223), Figure 
153 (SEQ ID NO:225). Figure 155 (SEQ ID NO:227). Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID 
NO:231), Figure 161 (SEQ ID NO:236). Figure 163 (SEQ ID NO:241). Figure 165 (SEQ ID NO:246), Figure 

10 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID 
NO:256), Figure 176 (SEQ ID NO:258). Figure 178 (SEQ ID NO:260). Figure 180 (SEQ ID NO:262), Figure 
182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268). Figure 188 (SEQ ID 
NO:270), Figure 190 (SEQ ID NO:272). Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276). Figure 
196 (SEQ ID NO:278). Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283). Figure 202 (SEQ ID 

15 NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291 ), Figure 
210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID 
NO:299). Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309), Figure 
228 (SEQ ID NO:314). Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID 
NO:334). Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345). Figure 242 (SEQ ID NO:347). Figure 

20 244 (SEQ ID NO:349). Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353). Figure 250 (SEQ ID 
NO:355). Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359). Figure 256 (SEQ ID NO:361), Figure 
258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365). Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID 
NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 
272 (SEQ ID NO:377). Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381). Figure 278 (SEQ ID 

25 NO:387), Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394). Figure 284 (SEQ ID NO:399), Figure 
286 (SEQ ID NO:401). Figure 288 (SEQ ID NO:403). Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID 
NO:410), Figure 294 (SEQ ID NO:412). Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:4 16), Figure 
300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422), Figure 306 (SEQ ID 
NO:424), Figure 308 (SEQ ID NO:495). Figure 310 (SEQ ID NO:497). Figure 3 12 (SEQ ID NO:499). Figure 

30 314 (SEQ ID NO:501), Figure 316 (SEQ ID NO:503). Figure 318 (SEQ ID NO:505). Figure 320 (SEQ ID 
NO:507). Figure 322 (SEQ ID NO:509), Figure 324 (SEQ ID N0:51 1), Figure 326 (SEQ ID NO:5 13), Figure 
328 (SEQ ID NO:515) or Figure 330 (SEQ ID NO:517), lacking its associated signal peptide. 

28. An isolated polypeptide having at least 80% amino acid sequence identity to: 
35 (a) the polypeptide shown in Figure 2 (SEQ ID N0:2), Figure 4 (SEQ ID N0:6), Figure 6 (SEQ 

ID N0:8). Figure 9 (SEQ ID NO:14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 
(SEQ ID NO:28). Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36). 
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Figure 27 (SEQ ID N0:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52). Figure 34 (SEQ ID 
NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), Figure 47 
(SEQ ID NO:84). Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), 
Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID N0:1 13). Figure 66 (SEQ ID NO: 1 15), Figure 68 (SEQ 
ID NO: 1 17), Figure 70 (SEQ ID NO: 119). Figure 72 (SEQ ID NO: 124). Figure 74 (SEQ ID NO: 129), Figure 
5 76 (SEQ ID NO:135), Figure 79 (SEQ ID NO:138), Figure 83 (SEQ ID NO:146), Figure 85 (SEQ ID 
NO:148). Figure 88 (SEQ ID N0:151), Figure 90 (SEQ ID NO: 153). Figure 93 (SEQ ID NO:156), Figure 
95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165). Figure 101 (SEQ ID 
NO: 167), Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 171). Figure 109 (SEQ ID NO: 175), Figure 
1 1 1 (SEQ ID NO: 177). Figure 1 13 (SEQ ID NO: 179), Figure 1 15 (SEQ ID NO: 181), Figure 117 (SEQ ID 

10 NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ ID NO: 194), Figure 125 (SEQ ID NO: 197). Figure 
127 (SEQ ID NO: 199), Figure 129 (SEQ ID N0:201), Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID 
NO:205), Figure 135 (SEQ ID NO:207). Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID NO:21 1), Figure 
141 (SEQ ID NO:213). Figure 144 (SEQ ID NO:216), Figure 147 (SEQ ID NO:219). Figure 149 (SEQ ID 
NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225). Figure 155 (SEQ ID NO:227), Figure 

15 157 (SEQ ID NO:229). Figure 159 (SEQ ID NO:231). Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID 
NO:241), Figure 165 (SEQ ID NO:246). Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250). Figure 
171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256). Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID 
NO:260), Figure 180 (SEQ ID NO:262). Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 
186 (SEQ ID NO:268). Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272). Figure 192 (SEQ ID 

20 NO:274), Figure 194 (SEQ ID NO;276), Figure 196 (SEQ ID NO:278), Figure 198 (SEQ ID NO:281). Figure 
200 (SEQ ID NO:283). Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID 
NO:289), Figure 208 (SEQ ID NO:29I), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 
214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), Figure 218 (SEQ ID NO:301). Figure 220 (SEQ ID 
NO:303), Figure 226 (SEQ ID NO:309). Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:3 19), Figure 

25 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID 
NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:35 1). Figure 
248 (SEQ ID NO:353). Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID 
NO:359). Figure 256 (SEQ ID NO:361). Figure 258 (SEQ ID NO:363). Figure 260 (SEQ ID NO:365), Figure 
262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371). Figure 268 (SEQ ID 

30 NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377). Figure 274 (SEQ ID NO:379), Figure 
276 (SEQ ID NO:381). Figure 278 (SEQ ID NO:387). Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID 
NO:394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:40I), Figure 288 (SEQ ID NO:403), Figure 
290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID 
NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID N0:41 8), Figure 302 (SEQ ID NO:420). Figure 

35 304 (SEQ ID NO:422), Figure 306 (SEQ ID NO:424), Figure 308 (SEQ ID NO:495). Figure 310 (SEQ ID 
NO:497), Figure 3 12 (SEQ ID NO:499), Figure 3 14 (SEQ ID NO:501), Figure 3 16 (SEQ ID NO:503). Figure 
318 (SEQ ID NO:505), Figure 320 (SEQ ID NO:507), Figure 322 (SEQ ID NO:509). Figure 324 (SEQ ID 
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N0:511), Figure 326 (SEQ ID NO:513). Figure 328 (SEQ ID NO:515) or Figure 330 (SEQ ID NO:517), 
lacking its associated signal peptide; 

(b) an extracellular domain of the polypeptide shown iai Figure 2 (SEQ ID N0:2). Figure 4 (SEQ 
ID NO:6), Figure 6 (SEQ ID NO:8). Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20). Figure 15 (SEQ 
ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33). Figure 25 
5 (SEQ ID NO:36), Figure 27 (SEQ ID N0:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), 
Figure 34 (SEQ ID NO:57). Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID 
NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO;95). Figure 51 (SEQ ID NO:97). Figure 53 
(SEQ ID NO:99), Figure 57 (SEQ ID NO:103), Figure 64 (SEQ ID NO: 1 13), Figure 66 (SEQ ID NO: 115), 
Figure 68 (SEQ ID NO: 117). Figure 70 (SEQ ID NO: 119), Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ 

10 ID NO: 129), Figure 76 (SEQ ID NO: 135). Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146). Figure 
85 (SEQ ID NO:148). Figure 88 (SEQ ID NO:151), Figure 90 (SEQ ID NO:153), Figure 93 (SEQ ID 
NO: 156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165), Figure 
101 (SEQ ID NO:167), Figure 103 (SEQ ID NO:l69), Figure 105 (SEQ ID N0:17I), Figure 109 (SEQ ID 
NO: 175), Figure 1 1 1 (SEQ ID NO: 177). Figure 1 13 (SEQ ID NO: 179), Figure 1 15 (SEQ ID NO: 181). Figure 

15 117 (SEQ ID NO:183). Figure 120 (SEQ ID NO:189), Figure 122 (SEQ ID NO:194), Figure 125 (SEQ ID 
NO:197), Figure 127 (SEQ ID NO: 199), Figure 129(SEQIDNO:201), Figure 131 (SEQ ID NO:203), Figure 
133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID 
NO:21 1), Figure 141 (SEQ ID NO:213), Figure 144 (SEQ ID NO:216). Figure 147 (SEQ ID NO:219), Figure 
149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID 

20 NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:23 1), Figure 161 (SEQ ID NO:236). Figure 
163 (SEQ ID NO:241), Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID 
NO:250), Figure 171 (SEQ ID NO:253). Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 
178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID 
NO:266), Figure 186 (SEQ ID NO:268). Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272). Figure 

25 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278). Figure 198 (SEQ ID 
NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 
206 (SEQ ID NO:289), Figure 20$ (SEQ ID NO:29I), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID 
NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), Figure218(SEQ IDNO:301), Figure 
220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID N0:314), Figure 230 (SEQ ID 

30 NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334). Figure 238 (SEQ ID NO:340). Figure 
240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349). Figure 246 (SEQ ID 
NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 
254 (SEQ ID NO:359). Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID 
NO:365), Figure 262 (SEQ ID NO:367). Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 

35 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377). Figure 274 (SEQ ID 
NO:379), Figure 276 (SEQ ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389), Figure 
282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399). Figure 286 (SEQ ID N0:401). Figure 288 (SEQ ID 
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NO;403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:4 10), Figure294(SEQIDNO:412), Figure 
296 (SEQ ID NO:414). Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID 
NO:420), Figure 304 (SEQ ID NO:422). Figure 306 (SEQ ID NO:424), Figure 308 (SEQ ID NO:495). Figure 
310 (SEQ ID NO:497), Figure 312 (SEQ ID NO:499), Figure 314 (SEQ ID NO:501). Figure 316 (SEQ ID 
NO:503), Figure 318 (SEQ ID NO:505). Figure 320 (SEQ ID NO:507), Figure 322 (SEQ ID NO:509). Figure 
5 324 (SEQ ID N0:51 1), Figure 326 (SEQ ID NO:513). Figure 328 (SEQ ID NO:5I5) or Figure 330 (SEQ ID 
NO:5I7), with its associated signal peptide; or 

(c) an extracellular domain of the polypeptide shown in Figure 2 (SEQ ID N0:2), Figure 4 (SEQ 
ID N0:6)» Figure 6 (SEQ ID N0:8), Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ 
ID NO:23). Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33>, Figure 25 

10 (SEQ ID NO:36), Figure 27 (SEQ ID N0:41). Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), 
Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID N0:67). Figure 41 (SEQ ID 
NO:73). Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID N0:95). Figure 51 (SEQ ID NO:97), Figure 53 
(SEQ ID N0:99), Figure 57 (SEQ ID NO:103). Figure 64 (SEQ ID NO:113). Figure 66 (SEQ ID NO: 115), 
Figure 68 (SEQ ID NO: 117). Figure 70 (SEQ ID NO: 119), Figure 72 (SEQ ID NO: 124). Figure 74 (SEQ 

15 ID NO: 1 29), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146). Figure 
85 (SEQ ID NO:148), Figure 88 (SEQ ID NO;151), Figure 90 (SEQ ID NO:153). Figure 93 (SEQ ID 
NO: 156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165). Figure 
101 (SEQ ID NO:167), Figure 103 (SEQ ID NO:169), Figure 105 (SEQ ID NO:171), Figure 109 (SEQ ID 
NO:175). Figure 111 (SEQ IDNO: 177), Figure 113 (SEQ ID NO: 179), Figure 115 (SEQID N0:181). Figure 

20 117 (SEQ ID NO:I83). Figure 120 (SEQ ID NO:189), Figure 122 (SEQ ID NO:194), Figure 125 (SEQ ID 
NO: 197). Figure 127 (SEQ ID NO: 199). Figure 129 (SEQ ID NO:201). Figure 13 1 (SEQ ID NO:203). Figure 
133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207). Figure 137 (SEQ ID NO:209). Figure 139 (SEQ ID 
NO:21 1), Figure 141 (SEQ ID NO:213). Figure 144 (SEQ ID NO:216), Figure 147 (SEQ ID NO:219), Figure 
149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID 

25 NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:231). Figure 161 (SEQ ID NO:236), Figure 
163 (SEQ ID NO:241). Figure 165 (SEQ ID NO:246). Figure 167 (SEQ ID NO:248). Figure 169 (SEQ ID 
NO:250). Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256). Figure 176 (SEQ ID NO:258), Figure 
178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262). Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID 
NO:266). Figure 186 (SEQ ID NO:268), Figure 188 (SEQ ID NO:270). Figure 190 (SEQ ID NO:272). Figure 

30 192 (SEQ ID NO:274). Figure 194 (SEQ ID NO:276). Figure 196 (SEQ ID NO:278). Figure 198 (SEQ ID 
NO:28l). Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285). Figure 204 (SEQ ID NO:287), Figure 
206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:29I), Figure 210 (SEQ ID NO:293). Figure 212 (SEQ ID 
NO:295), Figure 214 (SEQ ID NO:297). Figure 216 (SEQ ID NO:299). Figure 218 (SEQ ID NO:301). Figure 
220 (SEQ ID NO:303). Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID 

35 NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 
240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID 
NO:351). Figure 248 (SEQ ID NO:353). Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 
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254 (SEQ ID NO:359)» Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID 
NO:365). Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369). Figure 266 (SEQ ID NO:37l), Figure 
268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377). Figure 274 (SEQ ID 
NO:379). Figure 276 (SEQ ID NO:381). Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389). Figure 
282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399). Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID 
5 NO:403). Figure 290 (SEQ ID NO:408). Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:4 12), Figure 
296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID 
NO:420), Figure 304 (SEQ ID NO:422), Figure 306 (SEQ ID NO:424), Figure 308 (SEQ ID NO:495). Figure 
310 (SEQ ID NO:497)» Figure 312 (SEQ ID NO:499), Figure 314 (SEQ ID NO:501), Figure 316 (SEQ ID 
NO:503), Figure 318 (SEQ ID NO:505), Figure 320 (SEQ ID NO:507). Figure 322 (SEQ ID NO:509), Figure 
10 324 (SEQ ID N0:511). Figure 326 (SEQ ID NO:513). Figure 328 (SEQ ID NO:515) or Figure 330 (SEQ ID 
NO:517), lacking its associated signal peptide. 

29. A method of detecting a PR0943 polypeptide in a sample suspected of containing a PR0943 
polypeptide, said method comprising contacting said sample with a PR0183, PR0184 or PR0185 polypeptide 

15 and determining the formation of a PR0943/PR0183, PR0184 or PR0185 polypeptide conjugate in said 
sample, wherein the formation of said conjugate is indicative of the presence of a PR0943 polypeptide in said 
sample. 

30. The method according to Claim 29, wherein said sample comprises cells suspected of 
20 expressing said PR0943 polypeptide. 

31. The method according to Claim 29, wherein said PR0183 , PROl 84 or PRO 1 85 polypeptide 
is labeled with a detectable label. 

25 32. The method according to Claim 29, wherein said PR0183, PR0184 or PR0185 polypeptide 

is attached to a solid support. 

33- A method of detecting a PR0183, PR0184 or PR0185 polypeptide in a sample suspected 
of containing a PRO 1 83 , PROl 84 or PRO 1 85 polypeptide, said method comprising contacting said sample with 
30 a PR0943 polypeptide and determining the formation of a PR0943/PR0183, PR0184 or PR0185 polypeptide 
conjugate in said sample, wherein the formation of said conjugate is indicative of the presence of a PR0183, 
PRO 1 84 or PRO 185 polypeptide in said sample. 

34. The method according to Claim 33, wherein said sample comprises cells suspected of 
35 expressing said PR0183, PR0184 or PR0185 polypeptide. 
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35. The method according to Claim 33, wherein said PR0943 polypeptide is labeled with a 
detectable label. 

36. The method according to Claim 33, wherein said PR0943 polypeptide is attached to a solid 

support. 

5 

37. A method of detecting a PR033 1 polypeptide in a sample suspected of containing a PR033 1 
polypeptide, said method comprising contacting said sample with a PROl 133 polypeptide and determining the 
formation of a PR0331/PROn33 polypeptide conjugate in said sample, wherein the formation of said 
conjugate is indicative of the presence of a PR0331 polypeptide in said sample. 

10 

38. The method according to Claim 37, wherein said sample comprises cells suspected of 
expressing said PR0331 polypeptide. 

39. The method according to Claim 37, wherein said PROl 133 polypeptide is labeled with a 
15 detectable label. 

40. The method according to Claim 37, wherein said PROl 133 polypeptide is attached to a solid 

support. 

20 41. A method of detecting a PROl 133 polypeptide in a sample suspected of containing a 

PROl 133 polypeptide, said method comprising contacting said sample with a PR0331 polypeptide and 
determining the formation of a PR03 3 1 /PRO 1 1 33 polypeptide conjugate in said sample, wherein the formation 
of said conjugate is indicative of the presence of a PROl 133 polypeptide in said sample. 

25 42. Hie method according to Claim 41, wherein said sample comprises cells suspected of 

expressing said PROl 133 polypeptide. 

43. The method according to Claim 41, wherein said PR0331 polypeptide is labeled with a 
detectable label. 

30 

44 . The method according to Claim 4 1 , wherein said PR033 1 polypeptide is attached to a solid 

support. 

45. A method of detecting a PR0363 or PR05723 polypeptide in a sample suspected of 
35 containing a PR0363 or PR05723 polypeptide, said method comprising contacting said sample with a 

PRO 1387 polypeptide and determining the formation of a PR0363 or PR05723/PR01387 polypeptide 
conjugate in said sample, wherein the formation of said conjugate is indicative of the presence of a PR0363 
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or PR05723 polypeptide in said sample. 

46. The method according to Claim 45, wherein said sample comprises cells suspected of 
expressing said PR0363 or PR05723 polypeptide. 

5 47. The method according to Claim 45, wherein said PR01387 polypeptide is labeled with a 

detectable label. 

48. The method according to Claim 45 . wherein said PRO 1387 polypeptide is attached to a solid 

support. 

10 

49. A method of detecting a PR01387 polypeptide in a sample suspected of containing a 
PR01387 polypeptide, said method comprising contacting said sample with a PR0363 or PR05723 polypeptide 
and determining the formation of a PR0363 or PR05723/PR01387 polypeptide conjugate in said sample, 
wherein the formation of said conjugate is indicative of the presence of a PR01387 polypeptide in said sample. 

15 

50. The method according to Claim 49, wherein said sanq)le comprises cells suspected of 
expressing said PRO 1387 polypeptide. 

5 1 . The method according to Claim 49, wherein said PR0363 or PR05723 polypeptide is labeled 
20 with a detectable label. 

52. The method according to Claim 49, wherein said PR0363 or PR05723 polypeptide is 
attached to a solid support. 

25 53. A method of detecting a PR01U4 polypeptide in a sample suspected of containing a 

PR01114 polypeptide, said method comprising contacting said sample with a PRO3301 or PRO9940 
polypeptide and determining the formation of a PROl 1 14/PRO3301 or PRO9940 polypeptide conjugate in said 
sample, wherein the formation of said conjugate is indicative of the presence of a PROl 1 14 polypeptide in said 
sample. 

30 

54. The method according to Claim 53, wherein said sample comprises cells suspected of 
expressing said PROl 114 polypeptide. 

55. The method according to Claim 53, wherein said PRO3301 or PRO9940 polypeptide is 
35 labeled with a detectable label. 
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56. The method according to Claim 53, wherein said PRO3301 or PRO9940 polypeptide is 
attached to a solid support. 

57. A method of detecting a PRO3301 or PRO9940 polypeptide in a sample suspected of 
containing a PRO3301 or PRC)9940 polypeptide, said method comprising contacting said sample with a 

5 PROUH polypeptide and determining the formation of a PRO3301 or PRO9940/PRO1114 polypeptide 
conjugate in said sample, wherein the formation of said conjugate is indicative of the presence of a PRO3301 
or PRO9940 polypeptide in said sample. 

58. The method according to Claim 57, wherein said sample comprises cells suspected of 
10 ejqpressing said PRO3301 or PRO9940 polypeptide. 

59. The method according to Claim 57, wherein said PROl 1 14 polypeptide is labeled with a 
detectable label. 

15 60. The method according to Claim 57, wherein saiid PRO 1114 polypeptide is attached to a solid 

support. 

61. A method of detecting a PROl 181 polypeptide in a sample suspected of containing a 
PROl 181 polypeptide, said method comprising contacting said sample with a PRO7170, PR0361 or PR0846 

20 polypeptide and determining the formation of a PRO1181/PRO7170, PR0361 or PR0846 polypeptide 
conjugate in said sample, wherein the formation of said conjugate is indicative of the presence of a PROl 181 
polypeptide in said sample. 

62. The method according to Claim 61, wherein said sample comprises cells suspected of 
25 e;q>ressing said PROl 181 polypeptide. 

63 . The method according to Claim 61 , wherein said PR07 1 70, PR0361 or PR0846 polypeptide 
is labeled with a detectable label. 



30 64. The method accordmg to Claim 61 , wherein said PRO7170, PR0361 or PR0846 polypeptide 

is attached to a solid support. 

65. A method of detecting a PRO7170, PR0361 or PR0846 polypeptide in a sample suspected 
of containing a PRO7170, PR0361 or PR0846 polypeptide, said method comprising contacting said sample 
35 with a PROl 181 polypeptide and determining the formation of a PRO 1 181/PRO7170, PR0361 or PR0846 
polypeptide conjugate in said sample, wherein the formation of said conjugate is indicative of the presence of 
a PRO7170, PR0361 or PR0846 polypeptide in said sample. 
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66. The method according to Claim 65, wherein said sample comprises cells suspected of 
expressing said PRO7170, PR0361 or PR0846 polypeptide. 

67. The method according to Claim 65, wherein said PROl 181 polypeptide is labeled with a 
detectable label. 

5 

68 . The method according to Claim 65 , wherein said PRO 1181 polypeptide is attached to a solid 

support. 

69. A method of linking a bioactive molecule to a cell expressing a PR0943 polypeptide, said 
10 method comprising contacting said cell with a PR0183, PR0184 or PR0185 polypeptide that is bound to said 

bioactive molecule and allowing said PR0943 and PRO 183, PRO 184 or PRO 185 polypeptides to bind to one 
another, diereby linking said bioactive molecules to said cell. 

70. The method according to Claim 69, wherein said bioactive molecule is a toxin, a radiolabel 
15 or an antibody. 

7 1 . The method according to Claim 69, wherein said bioactive molecule causes the death of said 

cell. 



20 72. AmediodoflinkingabioactivemoleculetoacelIexpressingaPR0183. PRO 184 or PRO 185 

polypeptide, said method comprising contacting said cell with a PR0943 polypeptide that is bound to said 
bioactive molecule and allowing said PR0943 and PR0183, PR0184 or PR0185 polypeptides to bind to one 
another, thereby linking said bioactive molecules to said cell. 



25 73. The method according to Claim 72, wherein said bioactive molecule is a toxin, a radiolabel 

or an antibody. 

74 . The method according to Claim 73 , wherein said bioactive molecule causes the death of said 

cell. 

30 

75. A method of linking a bioactive molecule to a cell expressing a PR0331 polypeptide, said 
method comprising contacting said cell with a PROl 133 polypeptide that is bound to said bioactive molecule 
and allowing said PR0331 and PROl 133 polypeptides to bind to one another, thereby linking said bioactive 
molecules to said cell. 



76. The method according to Qaim 75, wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 
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77 . The method according to Claim 75 , wherei n said bioacti ve molecule causes the deadi of said 



78. A method of Unking a bioactive molecule to a cell expressing a PROl 133 polypeptide, said 
method comprising contacting said cell with a PR0331 polypeptide that is bound to said bioactive molecule 
and allowing said PR0331 and PROl 133 polypeptides to bind to one another, thereby linking said bioactive 
molecules to said cell. 

79. The method according to Claim 78. wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

80. The method according to Claim 78, wherein said bioactive molecule causes the death of said 

cell. 

81 . A method of linking a bioactive molecule to a cell expressing a PR01387 polypeptide, said 
method comprising contacting said cell with a PR0363 or PR05723 polypeptide that is bound to said bioactive 
molecule and allowing said PRO 1387 and PR0363 or PR05723 polypeptides to bind to one another, thereby 
linking said bioactive molecules to said cell. 

82. The method according to Claim 81 , wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

83 . The method according to Claim 8 1 , wherein said bioactive molecule causes the death of said 

cell. 

84. A method of linking a bioactive molecule to a cell expressing a PR0363 or PR05723 
polypeptide, said method comprising contacting said cell with a PROl 387 polypeptide that is bound to said 
bioactive molecule and allowing said PRO 1387 and PR0363 or PR05723 polypeptides to bind to one another, 
thereby linking said bioactive molecules to said cell. 

85. The method according to Claim 84. wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

86. The method according to Claim 84, wherein said bioactive molecule causes the death of said 

cell. 



cell. 



587 



wo 00/73454 Ij^^ PCT/USOO/08439 

87. A method of linking a bioactive molecule to a cell expressing a PROl 1 14 polypeptide, said 
method comprising contacting said cell with a PRO3301 or PRO9940 polypeptide that is bound to said 
bioactive molecule and allowing said PROl 1 14 and PRO3301 or PRO9940 polypeptides to bind to one another, 
thereby linking said bioactive molecules to said cell. 

5 88. The method according to Claim 87, wherein said bioactive molecule is a toxin, a radiolabe! 

or an antibody. 

89. The method according to Claim 87» wherein said bioactive molecule causes the death of said 

cell. 

10 

90. A method of linking a bioactive molecule to a cell expressing a PRO3301 or PRO9940 
polypeptide, said method comprising contacting said cell with a PROl 114 polypeptide that is bound to said 
bioactive molecule and allowing said PRO 1 1 1 4 and PRO330 1 or PRO9940 polypeptides to bind to one another, 
thereby linking said bioactive molecules to said cell. 

15 

91 . The method according to Claim 90, wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

92. The method according to Claim 90, wherein said bioactive molecule causes the death of said 

20 cell. 

93. A method of linking a bioactive molecule to a cell expressing a PROl 181 polypeptide, said 
method comprising contacting said cell with a PR07 170, PR0361 or PR0846 polypeptide that is bound to said 
bioactive molecule and allowing said PROl 181 and PR07170, PR0361 or PR0846 polypeptides to bind to 

25 one another, thereby linking said bioactive molecules to said cell. 

94. The method according to Claim 93, wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

30 95 . The method according to Claim 93 , wherein said bioactive molecule causes the death of said 

cell. 

96. A method of linking a bioactive molecule to a cell expressing a PRO7170, PR0361 or 
PR0846 polypeptide, said method comprising contacting said cell with a PROl 181 polypeptide that is boimd 
35 to said bioactive molecule and allowing said PROl 181 and PRO7170, PR0361 or PR0846 polypeptides to 
bind to one another, thereby linking said bioactive molecules to said cell. 
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97. The method according to Claim 96, wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

98 . The method according to Claim 96, wherein said bioactive molecule causes the death of said 

cell. 

5 

99. A method of modulating at least one biological activity of a cell expressing a PR0943 
polypeptide, said method comprising contacting said cell with a PRO 183, PRO 184 or PRO 185 polypeptide or 
an anti-PR0943 antibody, whereby said PR0183, PR0184 or PR0185 polypeptide or said anti-PR0943 
antibody binds to said PR0943 polypeptide, thereby modulating at least one biological activity of said cell. 

10 

\00. The method according to Claim 99, wherein said cell is killed. 

101. A method of modulating at least one biological activity of a cell expressing a PR0183, 
PRO 184 or PROl 85 polypeptide, said method comprising contacting said cell wth a PR0943 polypeptide or 

15 an anti-PR0183, anti-PR0184 or anti-PR0185 antibody, whereby said PR0943 polypeptide or said anti- 
PR0183, anti-PR0184 or anti-PR0185 antibody binds to said PR0183. PR0184 or PR0185 polypeptide, 
thereby modulating at least one biological activity of said cell. 

102. The method according to Claim 101, wherein said cell is killed, 

20 

103. A method of modulating at least one biological activity of a cell expressing a PR0331 
polypeptide, said method comprising contacting said cell with a PROl 133 polypeptide or an anti-PR0331 
antibody, whereby said PROl 133 polypeptide or said anti-PR033 1 antibody binds to said PR033 1 polypeptide, 
thereby modulating at least one biological activity of said cell. 

25 

104. The method according to Claim 103, wherein said cell is killed. 

105. A method of modulating at least one biological activity of a cell expressing a PROn33 
polypeptide, said method comprising contacting said cell with a PR0331 polypeptide or an anii-PROll33 

30 antibody, whereby said PR0331 polypeptide or said anti-PR01133 antibody binds to said PROl 133 
polypeptide, thereby modulating at least one biological activity of said celt. 

106. The method according to Claim 105, wherein said cell is killed. 

35 
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107. A method of modulating at least one biological activity of a cell expressing a PR01387 
polypeptide, said method comprising contacting said cell with a PR0363 or PR05723 polypeptide or an anti- 
PR01387 antibody, whereby said PR0363 or PR05723 polypeptide or said anti-PR01387 antibody binds to 
said PR01387 polypeptide, thereby modulating at least one biological activity of said cell. 

5 108. The method according to Claim 107, wherein said cell is killed. 

109. A method of modulating at least one biological activity of a cell expressing a PR0363 or 
PR05723 polypeptide, said method comprising contacting said cell with a PR01387 polypeptide or an anti- 
PR0363 or anti-PR05723 antibody, whereby said PRO 1387 polypeptide or said anti-PR0363 or anti-PR05723 

10 antibody binds to said PR0363 or PR05723 polypeptide, thereby modulating at least one biological activity 
of said cell. 

1 10. The method according to Claim 109, wherein said cell is killed. 

IS 1 1 1. A method of modulating at least one biological activity of a cell expressing a PROl 1 14 

polypeptide, said method comprising contacting said cell with a PRO3301 or PRO9940 polypeptide or an anti- 
PROll 14 antibody, whereby said PRO3301 or PRO9940 polypeptide or said anti-PR01114 antibody binds 
to said PROl 1 14 polypeptide, thereby modulating at least one biological activity of said cell. 

20 1 12. The method according to Claim 111, ^^erein said cell is killed. 

1 13. A method of modulating at least one biological activity of a cell expressing a PRO3301 or 
PRO9940 polypeptide, said method comprising contacting said cell with a PROl 114 polypeptide or an anti- 
PRO3301 or anti-PRO9940 antibody, whereby said PROl 114 polypeptide or said anti-PRO3301 or anti- 

25 PRO9940 antibody binds to said PRO330 1 or PRO9940 polypeptide, thereby modulating at least one biological 
activity of said cell. 

1 14. The method according to Claim 113, wherein said cell is killed. 

30 lis. A method of modulating at least one biological activity of a cell expressing a PROn81 

polypeptide, said method comprising contacting said cell with a PRO? 170, PR0361 or PR0846 polypeptide 
orananti-PROl 181 antibody, whereby said PRO7170, PR0361 or PR0846 polypeptide or said anti-PROl 181 
antibody binds to said PROl 181 polypeptide, thereby modulating at least one biological activity of said cell. 

35 116. The method according to Claim 115, wherein said cell is killed. 
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117. A method of modulating at least one biological activity of a cell expressing a PRO7170, 
PR0361 or PR0846 polypeptide, said method comprising contacting said cell with a PR01181 polypeptide 
or an anti-PRO7l70, anti-PR0361 or anti-PR0846 antibody, whereby said PROl 181 polypeptide or said anti- 
PRO7170. anti-PR0361 or anti-PR0846 antibody binds to said PRO7170. PR0361 or PR0846 polypeptide, 
thereby modulating at least one biological activity of said cell. 

1 18. The method according to Claim 1 17, wherein said cell is killed. 
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In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PR01384 polypeptide having the 
sequence of amino acid residues from about 1 to about 229, inclusive of Figure 306 (SEQ ID NO:424), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
5 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell 
comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1384 
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01384 antibody. 
10 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of 

a native PR01384 polypeptide, by contacting the native PR01384 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PRO 1384 
polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically 
15 acceptable carrier. 

136. Additional Embodiments 

In other embodiments of the present invention, the invention provides vectors comprising DNA 

encoding any of the herein described polypeptides. Host cell comprising any such vector are also provided. 
20 By way of example, the host cells may be CHO cells. E. coli, or yeast. A process for producing any of the 

herein described polypeptides is further provided and comprises culturing host cells under conditions suitable 

for expression of the desired polypeptide and recovering the desired polypeptide from the cell culture. 

In other embodiments, the invention provides chimeric molecules comprising any of the herein 

described polypeptides fiised to a heterologous polypeptide or amino acid sequence. Example of such chimeric 
25 molecules comprise any of the herein described polypeptides fused to an epitope tag sequence or a Fc region 

of an immunoglobulin. 

In another embodiment, the invention provides an antibody which specifically binds to any of the 

above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized 

antibody, antibody fragment or single-chain antibody. 
30 In yet other embodiments , the invention provides oligonucleotide probes useful for isolating genomic 

and cDNA nucleotide sequences or as antisense probes, wherein those probes may be derived from any of the 

above or below described nucleotide sequences. 

In other embodiments, the invention provides an isolated nucleic acid molecule comprising a 

nucleotide sequence that encodes a PRO polypeptide. 
35 In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 

80% sequence identity, preferably at least about 81 % sequence identity, more preferably at least about 82% 

sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 
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84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least 
about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably 
at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more 
preferably at least about 90% sequence identity, yet more preferably at least about 91 % sequence identity, yet 
more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence 
5 identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95% 
sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 
97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at 
least about 99% sequence identity to (a) a DNA molecule encoding a PRO polypeptide having a full-length 
amino acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, 

10 an extracellular domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or 
any other specifically defined fragment of the fiill-Iength amino acid sequence as disclosed herein, or (b) the 
complement of the DNA molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% sequence identity, preferably at least about 81 % sequence identity, more preferably at least about 

15 82 % sequence identity, yet more preferably at least about 83 % sequence identity , yet more preferably at least 
about 84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably 
at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more 
preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet 
more preferably at least about 90% sequence identity, yet more preferably at least about 91 % sequence 

20 identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% 
sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 
95% sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least 
about 97 % sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably 
at least about 99% sequence identity to (a) a DNA molecule comprising the coding sequence of a full-length 

25 PRO polypeptide cDNA as disclosed herein, the coding sequence of a PRO polypeptide lacking the signal 
peptide as disclosed herein, the coding sequence of an extracellular domain of a transmembrane PRO 
polypeptide, with or without the signal peptide, as disclosed herein or the coding sequence of any other 
specifically defined fragment of the full-length amino acid sequence as disclosed herein, or (b) the complement 
of the DNA molecule of <a). 

30 In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 

sequence having at least about 80% sequence identity, preferably at least about 81 % sequence identity, more 
preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet 
more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence 
identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% 

35 sequence identity, yet more preferably at least about 88 % sequence identity, yet more preferably at least about 
89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least 
about 91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably 



284 



wo 00/73454 




PCT/USOO/08439 



at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more 
preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence identity, yet 
more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity 
and yet more preferably at least about 99% sequence identity to (a) a DNA molecule that encodes the same 
mature polypeptide encoded by any of the human protein cDNAs deposited wi^ the ATCC as disclosed herein, 
or (b) the con^Iement of the DNA molecule of (a). 

Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide 
sequence encoding a PRO polypeptide which is either transmembrane domain-deleted or transmembrane 
domain-inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane 
domain(s) of such polypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein 
described PRO polypeptides are contemplated. 

Anodier embodiment is directed to fragments of a PRO polypeptide coding sequence, or the 
complement thereof, that may find use as, for example, hybridization probes, for encoding fragments of a PRO 
polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PRO antibody or 
as antisense oligonucleotide probes. Such nucleic acid firagments are usually at least about 20 nucleotides in 
length, preferably at least about 30 nucleotides in length, more preferably at least about 40 nucleotides in 
length, yet more preferably at least about 50 nucleotides in length, yet more preferably at least about 60 
nucleotides in length, yet more preferably at least about 70 nucleotides in length, yet more preferably at least 
about 80 nucleotides in length, yet more preferably at least about 90 nucleotides in length, yet more preferably 
at least about 100 nucleotides in length, yet more preferably at least about 110 nucleotides in length, yet more 
preferably at least about 120 nucleotides in length, yet more preferably at least about 130 nucleotides in length, 
yet more preferably at least about 140 nucleotides in length, yet more preferably at least about 150 nucleotides 
in length, yet more preferably at least about 160 nucleotides in length, yet more preferably at least about 170 
nucleotides in length, yet more preferably at least about 180 nucleotides in length, yet more preferably at least 
about 190 nucleotides in length, yet more preferably at least about 200 nucleotides in length, yet more 
preferably at least about 250 nucleotides in length, yet more preferably at least about 300 nucleotides in length, 
yet more preferably at least about 350 nucleotides in length, yet more preferably at least about 400 nucleotides 
in length, yet more preferably at least about 450 nucleotides in length, yet more preferably at least about 500 
nucleotides in length, yet more preferably at least about 600 nucleotides in length, yet more preferably at least 
about 700 nucleotides in length, yet more preferably at least about 800 nucleotides in length, yet more 
preferably at least about 900 nucleotides in length and yet more preferably at least about 1000 nucleotides in 
length, wherein in this context the term "about" means the referenced nucleotide sequence length plus or mmus 
10% of that referenced length. It is noted that novel fragments of a PRO polypeptide-encoding nucleotide 
sequence may be determined in a routine manner by aligning the PRO polypeptide-encoding nucleotide 
sequence with other known nucleotide sequences using any of a number of well known sequence alignment 
programs and determining which PRO polypeptide-encoding nucleotide sequence fragment(s) are novel. All 
of such PRO polypeptide-encoding nucleotide sequences are contemplated herein. Also contemplated are the 
PRO polypeptide fragments encoded by these nucleotide molecule fragments, preferably those PRO polypeptide 
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fragments that comprise a binding site for an antt-PRO antibody. 

In another embodiment^ the invention provides isolated PRO polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more 
preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet 
more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence 
identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% 
sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 
89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least 
about 91 % sequence identity, yet more preferably at least about 92% sequence identity! yet more preferably 
at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more 
preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence identity, yet 
more preferably at least about 97 % sequence identity, yet more preferably at least about 98% sequence identity 
and yet more preferably at least about 99% sequence identity to a PRO polypeptide having a fiill-length amino 
acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an 
extracellular domain of a transmembrane protein, with or without the signal pq>tide, as disclosed herein or any 
other specifically defined fragment of the full-length amino acid sequence as disclosed herein. 

In a fiirther aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 81 % sequence identity, more 
preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet 
more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence 
identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% 
sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 
89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least 
about 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably 
at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more 
preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence identity, yet 
more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity 
and yet more preferably at least about 99% sequence identity to an amino acid sequence encoded by any of the 
human protein cDNAs deposited with the ATCC as disclosed herein. 

In a fiuiher aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 81 % positives, more preferably at least 
about 82% positives, yet more preferably at least about 83% positives, yet more preferably at least about 84% 
positives, yet more preferably at least about 85% positives, yet more preferably at least about 86% positives, 
yet more preferably at least about 87% positives, yet more preferably at least about 88% positives, yet more 
preferably at least about 89% positives, yet more preferably at least about 90% positives, yet more preferably 
at least about 91 % positives, yet more preferably at least about 92% positives, yet more preferably at least 
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about 93 % positives, yet more preferably at least about 94 % positives, yet more preferably at least about 95 % 
positives, yet more preferably at least about 96% positives, yet more preferably at least about 97% positives, 
yet more preferably at least about 98% positives and yet more preferably at least about 99% positives when 
compared with the amino acid sequence of a PRO polypeptide having a full-length amino acid sequence as 
disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain 
of a transmembrane protein, with or without the signal peptide, as disclosed herein or any other specifically 
defined fragment of the full-length amino acid sequence as disclosed herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-temiinal signal 
sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino 
acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein 
those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding 
nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO 
polypeptide from the celt culture. 

Anodier aspect the invention provides an isolated PRO polypeptide which is either transmembrane 
domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide 
and recovering the PRO polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 
polypeptide as defined herein. In a particular embodiment, the agonist or antagonist is an anti-PRO antibody 
or a small molecule. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
PRO polypeptide which comprise contacting the PRO polypeptide with a candidate molecule and monitoring 
a biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native PRO 
polypeptide. 

In a still further embodiment, the inventioa concerns a composition of matter comprising a PRO 
polypeptide, or an agonist or antagonist of a PRO polypeptide as herein described, or an anti-PRO antibody, 
in combination with a carrier. Optionally, the carrier is a pharmaceulically acceptable carrier. 

Another embodiment of the present invention is directed to (he use of a PRO polypeptide, or an 
agonist or antagonist thereof as hereinbefore described, or an anti-PRO antibody, for the preparation of a 
medicament usefiil in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or 
antagonist diereof or an anti-PRO antibody. 



Figure 1 shows a nucleotide sequence (SEQ ID NO:l) of a native sequence PR0281 (UNQ244) 
cDNA, wherein SEQ ID NO:l is a clone designated herein as '*DNA 16422- 1209". 

Figure 2 shows the amino acid sequence (SEQ ID NO:2) derived from the coding sequence of SEQ 
ID NO: 1 shown in Figure 1. 



BRIEF DESCRIPTION OF THE DRAWINGS 
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Figure 3 shows a nucleotide sequence (SEQ ID NO:5) of a native sequence PR0276 (UNQ243) 
cDNA, wherein SEQ ID N0:5 is a clone designated herein as "^DNA 16435- 1208". 

Figure 4 shows the amino acid sequence (SEQ ID NO:6) derived from the coding sequence of SEQ 
ID N0:5 shown in Figure 3. 

Figure 5 shows a nucleotide sequence (SEQ ID N0:7) of a native sequence PR0189 (UNQ163) 
5 cDNA, wherein SEQ ID NO:7 is a clone designated herein as "DNA21624-1391", 



ID N0:7 shown in Figure 5. 

Figure 7 shows a nucleotide sequence designated herein as DNA14187 (SEQ ID N0:9). 
Figure 8 shows a nucleotide sequence (SEQ ID NO: 13) of a native sequence PRO190 (UNQ164) 
10 cDNA, wherein SEQ ID NO: 13 is a clone designated herein as "DNA23334-1392". 

Figure 9 shows the amino acid sequence (SEQ ID NO: 14) derived from the coding sequence of SEQ 
ID NO: 13 shown in Figure 8. 

Figure 10 shows a nucleotide sequence designated herein as DNA14232 (SEQ ID NO: 15). 
Figure 11 shows a nucleotide sequence (SEQ ID NO: 19) of a native sequence PR0341 (UNQ300) 
15 cDNA, wherein SEQ ID NO; 19 is a clone designated herein as "DNA26288-1239". 

Figure 12 shows the amino acid sequence (SEQ ID NO:20) derived from the coding sequence of SEQ 
ID NO: 19 shown in Figure 1 1 . 

Figure 13 shows a nucleotide sequence designated herein as DNA12920 (SEQ ID N0:21). 
Figure 14 shows a nucleotide sequence (SEQ ID NO:22) of a native sequence PRO180 (UNQ154) 
20 cDNA, wherein SEQ ID NO:22 is a clone designated herein as "DNA26843-1389". 

Figure 15 shows the amino acid sequence (SEQ ID NO:23) derived from the coding sequence of SEQ 
ID NO:22 shown in Figure 14. 

Figure 16 shows a nucleotide sequence designated herein as DNA 12922 (SEQ ID NO:24). 
Figure 17 shows a nucleotide sequence (SEQ ID NO:27) of a native sequence PR0194 (UNQ168) 
25 cDNA, wherein SEQ ID NO: 27 is a clone designated herein as ''DNA26844-1394". 

Figure 18 shows the amino acid sequence (SEQ ID NO:28) derived from the coding sequence of SEQ 
ID NO: 27 shown in Figure 17. 

Figure 19 shows a nucleotide sequence (SEQ ID NO:29) of a native sequence PRO203 (UNQ177) 
cDNA. wherein SEQ ID NO:29 is a clone designated herein as "DNA30862-1396". 
30 Figure 20 shows the amino acid sequence (SEQ ID NO:30) derived from the coding sequence of SEQ 

ID NO:29 shown in Figure 19. 

Figure 21 shows a nucleotide sequence designated herein as DNA 156 18 (SEQ ID N0:31). 
Figure 22 shows a nucleotide sequence (SEQ ID NO: 32) of a native sequence PRO290 (UNQ253) 
cDNA, wherein SEQ ID NO:32 is a clone designated herein as "DNA3568()-1212". 
35 Figure 23 shows the amino acid sequence (SEQ ID NO : 33 ) derived from the coding sequence of SEQ 



Figure 6 shows the amino acid sequence (SEQ ID NO:8) derived from the coding sequence of SEQ 



ID NO:32 shown in Figure 22. 
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Figure 24 shows a nucleotide sequence (SEQ ID NO:35) of a native sequence PR0874 (UNQ441) 
cDNA, wherein SEQ ID NO:35 is a clone designated herein as "DNA40621-1440". 

Figure 25 shows the amino acid sequence (SEQ ID NO: 36) derived from the coding sequence of SEQ 
ID NO:35 shown in Figure 24. 

Figure 26 shows a nucleotide sequence (SEQ ID NO:40) of a native sequence PRO? 10 (UNQ374) 
cDNA, wherein SEQ ID NO:40 is a clone designated herein as "DNA44161-1434". 

Figure 27 shows the amino acid sequence (SEQ ID N0:41) derived from the coding sequence of SEQ 
ID NO:40 shown in Figure 26. 

Figure 28 shows a nucleotide sequence designated herein as DNA38190 (SEQ ID NO:42). 

Figure 29 shows a nucleotide sequence (SEQ ID NO:46) of a native sequence PR01151 (UNQ581) 
cDNA, wherein SEQ ID NO:46 is a clone designated herein as "DNA44694-15(K)". 

Figure 30 shows the amino acid sequence (SEQ ID NO:47) derived from the coding sequence of SEQ 
ID NO:46 shown in Figure 29. 

Figure 31 shows a nucleotide sequence (SEQ ID NO:51) of a native sequence PR01282 (UNQ652) 
cDNA, wherein SEQ ID N0:51 is a clone designated herein as "DNA45495-1550". 

Figure 32 shows the amino acid sequence (SEQ ID NO:52) derived from the coding sequence of SEQ 
ID NO:51 shown in Figure 31. 

Figure 33 shows a nucleotide sequence (SEQ ID NO: 56) of a native sequence PR0358 cDNA, 
wherein SEQ ID NO:56 is a clone designated herein as "DNA47361-1 154. 

Figure 34 shows the amino acid sequence (SEQ ID NO:57) derived firom the coding sequence of SEQ 
ID NO:56 shown in Figure 33. 

Figure 35 shows a nucleotide sequence (SEQ ID N0:61) of a native sequence PRO1310 cDNA, 
wherein SEQ ID NO:61 is a clone designated herein as ''DNA47394-1572. 

Figure 36 shows the amino acid sequence (SEQ ID NO:62) derived from the coding sequence of SEQ 
ID N0:61 shown in Figure 35. 

Figure 37 shows a nucleotide sequence (SEQ ID NO:66) of a native sequence PR0698 (UNQ362) 
cDNA, wherein SEQ ID NO:66 is a clone designated herein as "DNA48320-1433". 

Figure 38 shows the amino acid sequence (SEQ ID NO: 67) derived from the coding sequence of SEQ 
ID NO:66 shown in Figure 37. 

Figure 39 shows a nucleotide sequence designated herein as DNA39906 (SEQ ID NO:68). 

Figure 40 shows a nucleotide sequence (SEQ ID NO:72) of a native sequence PR0732 (UNQ396) 
cDNA, wherein SEQ ID N0:72 is a clone designated herein as "DNA48334-1435". 

Figure 41 shows the amino acid sequence (SEQ ID NO:73) derived from the coding sequence of SEQ 
ID NO:72 shown in Figure 40. 

Figure 42 shows a nucleotide sequence designated herein as DNA20239 (SEQ ID NO: 74). 

Figure 43 shows a nucleotide sequence designated herein as DNA38050 (SEQ ID NO: 75). 

Figure 44 shows a nucleotide sequence designated herein as DNA40683 (SEQ ID NO: 76). 

Figure 45 shows a nucleotide sequence designated herein as DNA42580 (SEQ ID NO: 77). 
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Figure 46 shows a nucleotide sequence (SEQ ID NO:83) of a native sequence PROl 120 (UNQ559) 
cDNA, wherein SEQ ID NO:83 is a clone designated herein as "DNA48606-1479". 

Figure 47 shows the amino acid sequence (SEQ ID NO: 84) derived from the coding sequence of SEQ 
ID NO:83 shown in Figure 46. 

Figure 48 shows a nucleotide sequence (SEQ ID NO:94) of a native sequence PR0537 (UNQ338) 
5 cDNA, wherein SEQ ID NO:94 is a clone designated herein as ''DNA49141-I43r. 

Figure 49 shows the amino acid sequence (SEQ ID NO:95) derived from the coding sequence of SEQ 
ID NO:94 shown in Figure 48. 

Figure 50 shows a nucleotide sequence (SEQ ID NO:96) of a native sequence PR0536 (UNQ337) 
cDNA, wherein SEQ ID NO:96 is a clone designated herein as "DNA49142-1430". 
10 Figure 5 1 shows the amino acid sequence (SEQ ID NO:97) derived from the coding sequence of SEQ 

ID NO:96 shown in Figure 50. 

Figure 52 shows a nucleotide sequence (SEQ ID NO:98) of a native sequence PR0535 (UNQ336) 
cDNA. wherein SEQ ID NO:98 is a clone designated herein as "DNA49143-1429". 

Figure 53 shows the amino acid sequence (SEQ ID NO: 99) derived from the coding sequence of SEQ 
15 ID NO:98 shown in Figure 52. 

Figure 54 shows a nucleotide sequence designated herein as DNA30861 (SEQ ID NO: 100). 

Figure 55 shows a nucleotide sequence designated herein as DNA36351 (SEQ ID NO: 101). 

Figure 56 shows a nucleotide sequence (SEQ ID NO: 102) of a native sequence PR0718 (UNQ386) 
cDNA, wherein SEQ ID NO: 102 is a clone designated herein as "DNA49647-1398". 
20 Figure 57 shows the amino acid sequence (SEQ ID NO: 103) derived from the coding sequence of SEQ 

ID NO: 102 shown in Figure 56. 

Figure 58 shows a nucleotide sequence designated herein as DNA 15386 (SEQ ID NO: 104). 

Figure 59 shows a nucleotide sequence designated herein as DNA 16630 (SEQ ID NO: 105). 

Figure 60 shows a nucleotide sequence designated herein as DNA16829 (SEQ ID NO: 106). 
25 Figure 61 shows a nucleotide sequence designated herein as DNA28357 (SEQ ID NO: 107). 

Figure 62 shows a nucleotide sequence designated herein as DNA43512 (SEQ ID NO: 108). 

Figure 63 shows a nucleotide sequence (SEQ ID NO: 1 12) of a native sequence PR0872 (UNQ439) 
cDNA, wherein SEQ ID NO: 112 is a clone designated herein as "DNA49819-1439". 

Figure 64 shows the amino acid sequence (SEQ ID NO: 1 13) derived from the coding sequence of SEQ 
30 ID NO: 1 12 shown in Figure 63, 

Figure 65 shows a nucleotide sequence (SEQ ID NO:l 14) of a native sequence PRO1063 (UNQI28) 
cDNA, wherein SEQ ID N0:1 14 is a clone designated herein as "DNA49820-1427". 

Figure 66 shows the amino acid sequence (SEQ ID NO: 1 15) derived from the coding sequence of SEQ 
ID NO: 114 shown in Figure 65. 
35 Figure 67 shows a nucleotide sequence (SEQ ID NO: 1 16) of a native sequence PR0619 (UNQ355) 

cDNA, wherein SEQ ID N0:1 16 is a clone designated herein as "DNA4982i-1562". 
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Figure 68 shows the amino acid sequence (SEQ ID NO: 1 1 7) derived from the coding sequence of SEQ 
ID NO: 1 16 shown in Figure 67. 

Figure 69 shows a nucleotide sequence (SEQ ID NO: 118) of a native sequence PR0943 (UNQ480) 
cDNA, wherein SEQ ID NO: 118 is a clone designated herein as ''DNA52 192- 1369". 

Figure 70 shows the amino acid sequence (SEQ ID NO: 1 1 9) derived from the coding sequence of SEQ 
5 ID NO: U 8 shown in Figure 69. 

Figure 71 shows a nucleotide sequence (SEQ ID NO: 123) of a native sequence PROl 188 (UNQ602) 
cDNA, wherein SEQ ID NO: 123 is a clone designated herein as "DNA52598-1518". 

Figure 72 shows the amino acid sequence (SEQ ID NO: 124) derived from the coding sequence of SEQ 
ID NO: 123 shown in Figure 71. 
10 Figure 73 shows a nucleotide sequence (SEQ ID NO: 128) of a native sequence PROl 133 (UNQ571) 

cDNA, wherein SEQ ID NO: 128 is a clone designated herein as "DNA53913-1490". 

Figure 74 shows the amino acid sequence (SEQ ID NO: 1 29) derived from the coding sequence of SEQ 
ID NO: 128 shown in Figure 73. 

Figure 75 shows a nucleotide sequence (SEQ ID NO: 134) of a native sequence PR0784 (UNQ459) 
15 cDNA, wherein SEQ ID NO: 134 is a clone designated herein as '•DNA53978-1443". 

Figure 76 shows the amino acid sequence (SEQ ID NO : 1 35) derived from the coding sequence of SEQ 
ID NO: 134 shown in Figure 75. 

Figure 77 shows a nucleotide sequence designated herein as DNA44661 (SEQ ID NO: 136). 

Figure 78 shows a nucleotide sequence (SEQ ID NO: 137) of a native sequence PR0783 (UNQ458) 
20 cDNA, wherein SEQ ID NO: 137 is a clone designated herein as "DNA53996-1442". 

Figure 79 shows the amino acid sequence (SEQ ID NO: 138) derived from the coding sequence of SEQ 
ID NO: 137 shown in Figure 78. 

Figure 80 shows a nucleotide sequence designated herein as DNA45201 (SEQ ID NO: 139). 

Figure 81 shows a nucleotide sequence designated herein as DNA14575 (SEQ ID NO: 140). 
25 Figure 82 shows a nucleotide sequence (SEQ ID NO: 145) of a native sequence PRO820 (UNQ503) 

cDNA, wherein SEQ ID NO: 145 is a clone designated herein as "DNA56041-1416". 

Figure 83 shows the amino acid sequence (SEQ ID NO: 1 46) derived from the codmg sequence o f SEQ 
ID NO: 145 shown in Figure 82. 

Figure 84 shows a nucleotide sequence (SEQ ID NO: 147) of a native sequence PRO 1080 (UNQ537) 
30 cDNA, wherein SEQ ID NO: 147 is a clone designated herein as "DNA56047-1456" . 

Figure 85 shows the amino acid sequence (SEQ ID NO: 148) derived from the coding sequence of SEQ 
ID NO: 147 shown in Figure 84. 



35 cDNA. wherein SEQ ID NO: 150 is a clone designated herein as "DNA56050-1455". 

Figure 88 shows the amino acid sequence (SEQ ID NO:151) derived from the coding sequenceof SEQ 
ID NO: 150 shown in Figure 87. 



Figure 86 shows a nucleotide sequence designated herein as DNA36527 (SEQ ID NO: 149). 
Figure 87 shows a nucleotide sequence (SEQ ID NO: 150) of a native sequence PRO 1079 (UNQ536) 
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Figure 89 shows a nucleotide sequence (SEQ ID NO: 152) of a native sequence PR0793 (UNQ432) 
cDNA. wherein SEQ ID NO: 152 is a clone designated herein as "DNA56110-1437\ 

Figure 90 shows the amino acid sequence (SEQ ID NO: 153) derived from the coding sequence of SEQ 
ID NO: 152 shown in Figure 89. 

Figure 91 shows a nucleotide sequence designated herein as DNA50177 (SEQ ID NO: 154). 
5 Figure 92 shows a nucleotide sequence (SEQ ID NO: 155) of a native sequence PRO1016 (UN(2499) 

cDNA, wherein SEQ ID NO:155 is a clone designated herein as "DNASei 13-1378". 

Figure 93 shows the amino acid sequence (SEQ ID NO: 156) derived from the coding sequence of SEQ 
ID NO: 155 shown in Figure 92. 

Figure 94 shows a nucleotide sequence (SEQ ID NO: 157) of a native sequence PRO1013 (UN(2496) 
10 cDNA, wherein SEQ ID NO: 157 is a clone designated herein as "DNA56410-1414". 

Figure 95 shows the amino acid sequence (SEQ ID NO: 158) derived from the coding sequence of SEQ 
ID NO: 157 shown in Figure 94. 

Figure 96 shows a nucleotide sequence (SEQ ID NO: 159) of a native sequence PR0937 (UN(J474) 
cDNA, wherein SEQ ID NO: 159 is a clone designated herein as "DNA5643.6-1448". 
1 5 Figure 97 shows the amino acid sequence (SEQ ID NO: 1 60) derived from the coding sequence of SEQ 

ID NO: 159 shown in Figure 96. 

Figure 98 shows a nucleotide sequence (SEQ ID NO: 164) of a native sequence PR0842 (UN(J473) 
cDNA, wherein SEQ ID NO: 164 is a clone designated herein as "DNA5 6855- 1447". 

Figure 99 shows the amino acid sequence (SEQ ID NO: 165) derived from the coding sequence of SEQ 
20 ID NO: 164 shown in Figure 98. 

Figure 100 shows a nucleotide sequence (SEQ ID NO: 166) of a native sequence PR0839 (VNQ472) 
cDNA, wherein SEQ ID NO:166 is a clone designated herein as "DNA56859-1445". 

Figure 101 shows the amino acid sequence (SEQ ID NO: 167) derived from the coding sequence of 
SEQ ID NO: 166 shown in Figure 100. 
25 Figure 102 shows a nucleotide sequence (SEQ ID NO: 168) of a native sequence PROl 180 (UNQ594) 

cDNA, wherein SEQ ID NO: 168 is a clone designated herein as "DNA56860-1510". 

Figure 103 shows the amino acid sequence (SEQ ID NO: 169) derived from the coding sequence of 
SEQ ID NO: 168 shown in Figure 102. 

Figure 104 shows a nucleotide sequence (SEQ ID NO: 170) of a native sequence PROl 134 (UNQ572) 
30 cDNA, wherein SEQ ID NO: 170 is a clone designated herein as "DNA56865-1491 

Figure 105 shows the amino acid sequence (SEQ ID NO: 171) derived from the coding sequence of 
SEQ ID NO: 170 shown in Figure 104. 

Figure 106 shows a nucleotide sequence designated herein as DNA52352 (SEQ ID NO: 172). 

Figure 107 shows a nucleotide sequence designated herein as DNA55725 (SEQ ID NO: 173). 
35 Figure 108 shows a nucleotide sequence (SEQ ID NO: 174) of a native sequence PRO830 (UNQ470) 

cDNA, wherein SEQ ID NO: 174 is a clone designated herein as ''DNA56866-1342". 
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Figure 109 shows the amino acid sequence (SEQ ID NO: 175) derived from the coding sequence of 
SEQ ID NO: 174 shown in Figure 108. 

Figure 1 10 shows a nucleotide sequence (SEQ ID NO: 176) of a native sequence PROl 1 15 (UNQ558) 
cDNA, wherein SEQ ID NO: 176 is a clone designated herein as "DNA56868-1478". 

Figure 1 1 1 shows the amino acid sequence (SEQ ID NO: 177) derived from the coding sequence of 
5 SEQ ID NO: 176 shown in Figure 1 10. 

Figure 1 12 shows a nucleotide sequence (SEQ ID NO: 178) of a native sequence PRO 1277 (UNQ647) 
cDNA, wherein SEQ ID NO: 178 is a clone designated herein as "DNA56869-1545". 

Figure 1 13 shows the amino acid sequence (SEQ ID NO: 179) derived from the coding sequence of 
SEQ ID NO: 178 shown in Figure 1 12. 
10 Figure 1 14 shows a nucleotide sequence (SEQ ID NO: 1 80) of a native sequence PRO 1 135 (UNQ573) 

cDNA, wherein SEQ ID NO: 180 is a clone designated herein as "DNA56870-1492". 

Figure 115 shows the amino acid sequence (SEQ ID NO: 181) derived from the coding sequence of 
SEQ ID NO: 180 shown in Figure 114. 

Figure 1 16 shows a nucleotide sequence (SEQ ID NO: 182) of a native sequence PROl 1 14 (UNQ557) 
15 cDNA, wherein SEQ ID NO: 182 is a clone designated herein as "DNA57033-1403" . 

Figure 1 17 shows the amino acid sequence (SEQ ID NO: 183) derived from the coding sequence of 
SEQ ID NO: 182 shown in Figure 116. 

Figure 1 18 diows a nucleotide sequence designated herein as DNA48466 (SEQ ID NO: 184). 

Figure 1 19 shows a nucleotide sequence (SEQ ID NO: 1 88) of a native sequence PR0828 (UN(2469) 
20 cDNA, wherein SEQ ID NO: 188 is a clone designated herein as "DNA57037-1444''. 

Figure 120 shows the amino acid sequence (SEQ ID NO: 189) derived from the coding sequence of 
SEQ ID NO: 188 shown in Figure 119. 

Figure 121 shows a nucleotide sequence (SEQ ID NO: 193) of a native sequence PRO1009 (UNQ493) 
cDNA, wherein SEQ ID NO: 193 is a clone designated herein as "DNA57129-1413". 
25 Figure 122 shows the amino acid sequence (SEQ ID NO: 194) derived from the coding sequence of 

SEQ ID NO: 193 shown in Figure 121. 

Figure 123 shows a nucleotide sequence designated herein as DNA50853 (SEQ ID NO: 195). 

Figure 124 shows a nucleotide sequence (SEQ ID NO: 196) of a native sequence PRO1007 (UNQ491) 
cDNA. wherein SEQ ID NO: 196 is a clone designated herein as "DNA57690-1374". 
30 Figure 125 shows the amino acid sequence (SEQ ID NO: 197) derived from the coding sequence of 

SEQ ID NO: 196 shown in Figure 124. 

Figure 126 shows a nucleotide sequence (SEQ ID NO: 198) of a native sequence PRO1056 (UNQ521) 
cDNA, wherein SEQ ID NO: 198 is a clone designated herein as "DNA57693-1424". 

Figure 127 shows the amino acid sequence (SEQ ID NO: 199) derived from the coding sequence of 
35 SEQ ID NO: 198 shown in Figure 126. 

Figure 128 shows a nucleotide sequence (SEQ ID NO:200) of a native sequence PR0826 (UNQ467) 
cDNA, wherein SEQ ID NO:200 is a clone designated herein as "DNA57694-134r. 
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Figure 129 shows the amino acid sequence (SEQ ID NO:201) derived from the coding sequence of 
SEQ ID NO:200 shown in Figure 128. 

Figure 130 shows a nucleotide sequence (SEQ ID NO:202) of a native sequence PR0819 (UNQ466) 
cDNA, wherein SEQ ID NO:202 is a clone designated herein as "DNA57695-1340\ 

Figure 131 shows the amino acid sequence (SEQ ID NO:203) derived from the coding sequence of 
SEQ ID NO:202 shown in Figure 130. 

Figure 132 shows a nucleotide sequence (SEQ ID NO:204) of a native sequence PRO1006 (UNQ490) 
cDNA, wherein SEQ ID NO:204 is a clone designated herein as "DNA57699-1412". 

Figure 133 shows the amino acid sequence (SEQ ID NO:205) derived from the coding sequence of 
SEQ ID NO:204 shown in Figure 132. 

Figure 134 shows a nucleotide sequence (SEQ ID NO:206) of a native sequence PROl 1 12 (UNQ555) 
cDNA, wherein SEQ ID NO:206 is a clone designated herein as "DNA57702-1476". 

Figure 135 shows the amino acid sequence (SEQ ID NO:207) derived from the coding sequence of 
SEQ ID NO:206 shown in Figure 134. 

Figure 136 shows a nucleotide sequence (SEQ ID NO:208) of a native sequence PRO1074 (UNQ531) 
cDNA. wherein SEQ ID NO:208 is a clone designated herein as "DNA57704-1452''. 

Figure 137 shows the amino acid sequence (SEQ ID NO: 209) derived from the coding sequence of 
SEQ ID NO:208 shown in Figure 136, 

Figure 138 shows a nucleotide sequence (SEQ ID NO:210) of a native sequence PRO1005 (UNQ489) 
cDNA, wherein SEQ ID NO:210 is a clone designated herein as ''DNA57708-1005". 

Figure 139 shows the amino acid sequence (SEQ ID NO:211) derived from the coding sequence of 
SEQ ID NO:210 shown in Figure 138. 

Figure 140 shows a nucleotide sequence (SEQ ID NO;212) of a native sequence PRO1073 (UNQ530) 
cDNA, wherein SEQ ID NO:212 is a clone designated herein as "DNA57710-145r. 

Figure 141 shows the amino acid sequence (SEQ ID NO:213) derived from the coding sequence of 
SEQ ID NO:212 shown in Figure 140. 

Figure 142 shows a nucleotide sequence designated herein as DNA55938 (SEQ ID NO:214). 

Figure 143 shows a nucleotide sequence (SEQ ID NO:215) of a native sequence PROl 152 (UNQ582) 
cDNA, wherein SEQ ID NO:215 is a clone designated herein as "DNA5771 1-1501". 

Figure 144 shows the amino acid sequence (SEQ ID NO:216) derived from the coding sequence of 
SEQ ID NO:215 shown in Figure 143. 

Figure 145 shows a nucleotide sequence designated herein as DNA55807 (SEQ ID NO:217). 

Figure 146 shows a nucleotide sequence (SEQ ID NO:218) of a native sequence PROl 136 (UNQ574) 
cDNA, wherein SEQ ID NO:218 is a clone designated herein as "DNA57827-1493". 

Figure 147 shows the amino acid sequence (SEQ ID NO:219) derived from the coding sequence of 
SEQ ID NO:218 shown in Figure 146. 

Figure 148 shows a nucleotide sequence (SEQ ID NO:220) of a native sequence PR0813 (UN(2465) 
cDNA, wherein SEQ ID NO:220 is a clone designated herein as "DNA57834-1339". 
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Figure 149 shows the amino acid sequence <SEQ ID NO;221) derived from the coding sequence of 
SEQ ID NO:220 shown in Figure 148. 

Figure 150 shows a nucleotide sequence (SEQ ID NO:222) of a native sequence PRO809 (UNQ464) 
cDNA, wherein SEQ ID NO:222 is a clone designated herein as "DNA57836-I338\ 

Figure 151 shows the amino acid sequence (SEQ ID NO:223) derived from the coding sequence of 
SEQ ID NO:222 shown in Figure 150. 

Figure 152 shows a nucleotide sequence (SEQ ID NO:224) of a native sequence PR0791 (UNQ463) 
cDNA. wherein SEQ ID NO:224 is a clone designated herein as "DNA57838-1337\ 

Figuire 153 shows the amino acid sequence (SEQ ID NO:225) derived from the coding sequence of 
SEQ ID NO:224 shown in Figure 152. 

Figure 154 shows a nucleotide sequence (SEQ ID NO:226) of a native sequence PRO1004 (UNQ488) 
cDNA, wherein SEQ ID NO:226 is a clone designated herein as "DNA57844-1410". 

Figure 155 shows the amino acid sequence (SEQ ID NO:227) derived from the coding sequence of 
SEQ ID NO:226 shown in Figure 154. 

Figure 156 shows a nucleotide sequence (SEQ ID NO:228) of a native sequence PROl 1 1 1 (UNQ554) 
cDNA, wherein SEQ ID NO:228 is a clone designated herein as "DNA58721-1475-. 

Figure 157 shows the amino acid sequence (SEQ ID NO:229) derived from the coding sequence of 
SEQ ID NO:228 shown in Figure 156. 

Figure 158 shows a nucleotide sequence (SEQ ID NO:230) of a native sequence PR01344 (UNQ699) 
cDNA, wherein SEQ ID NO:230 is a clone designated herein as "DNA58723-1588". 

Figure 159 shows the amino acid sequence (SEQ ID NO:231) derived from the coding sequence of 
SEQ ID NO:230 shown in Figure 158. 

Figure 160 shows a nucleotide sequence (SEQ ID NO:235) of a native sequence PROl 109 (UNQ552) 
cDNA» wherein SEQ ID NO:235 is a clone designated herein as "DNA58737-1473". 

Figure 161 shows the amino acid sequence (SEQ ID NO: 236) derived from the coding sequence of 
SEQ ID NO:235 shown in Figure 160. 

Figure 162 shows a nucleotide sequence (SEQ ID NO:240) of a native sequence PR01383 (UNQ719) 
cDNA, wherein SEQ ID NO: 240 is a clone designated herein as "DNA58743-1609". 

Figure 163 shows the amino acid sequence (SEQ ID NO:241) derived from the coding sequence of 
SEQ ID NO:240 shown in Figure 162. 

Figure 164 shows a nucleotide sequence (SEQ ID NO:245) of a native sequence PRO1003 (UNQ487) 
cDNA, wherein SEQ ID NO:245 is a clone designated herein as "DNA58846-1409". 

Figure 165 shows the amino acid sequence (SEQ ID NO:246) derived from the coding sequence of 
SEQ ID NO: 245 shown in Figure 164. 

Figure 166 shows a nucleotide sequence (SEQ ID NO:247) of a native sequence PROl 108 (UNQ551) 
cDNA, wherein SEQ ID NO: 247 is a clone designated herein as "DNA58848-1472". 

Figure 167 shows the amino acid sequence (SEQ ID NO:248) derived from the coding sequence of 
SEQ ID NO:247 shown in Figure 1 66. 
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Figure 168 shows a nucleotide sequence (SEQ ID NO:249) of a native sequence PROl 137 (UNQ575) 
cDNA, wherein SEQ ID NO:249 is a clone designated herein as "DNA58849-1494". 

Figure 169 shows the amino acid sequence (SEQ ID NO: 250) derived from the coding sequence of 
SEQ ID NO:249 shown in Figure 168. 

Figure 170 shows a nucleotide sequence (SEQ ID NO:252) of a native sequence PROl 138 (UNQ576) 
5 cDNA, wherein SEQ ID NO:252 is a clone designated herein as ■DNA58850-1495". 

Figure 171 shows the amino acid sequence (SEQ ID NO: 253) derived from the coding sequence of 
SEQ ID NO:252 shown in Figure 170. 

Figure 172 shows a nucleotide sequence designated herein as DNA49140 (SEQ ID NO:254), 

Figure 173 shows a nucleotide sequence (SEQ ID NO:255) of a native sequence PRO1054 (UNQ5 19) 
10 cDNA. wherein SEQ ID NO:255 is a clone designated herein as "DNA58853-1423". 

Figure 174 shows the amino acid sequence (SEQ ID NO:256) derived from the coding sequence of 
SEQ ID NO:255 shown in Figure 173. 

Figure 175 shows a nucleotide sequence (SEQ ID NO:257) of a native sequence PR0994 (UNQ518) 
cDNA, wherein SEQ ID NO:257 is a clone designated herein as ''DNA58855-1422". 
15 Figure 176 shows the ammo acid sequence (SEQ ID NO:258) derived from the coding sequence of 

SEQ ID NO:257 shown in Figure 175. 

Figure 177 shows a nucleotide sequence (SEQ ID NO:259) of a native sequence PR0812 (UNQ517) 
cDNA, wherein SEQ ID NO:259 is a clone designated herein as "DNA59205-1421", 

Figure 178 shows the amino acid sequence (SEQ ID NO: 260) derived from the coding sequence of 
20 SEQ ID NO:259 shown in Figure 177. 

Figure 179 shows a nucleotide sequence (SEQ ID NO:261) of a native sequence PRO1069 (UNQ526) 
cDNA, wherein SEQ ID NO:261 is a clone designated herem as "DNA5921 1-1450". 

Figure 180 shows the amino acid sequence (SEQ ID NO:262) derived from the coding sequence of 
SEQ ID NO: 261 shown in Figure 179. 
25 Figure 181 shows a nucleotide sequence (SEQ ID NO:263) of a native sequence PROl 129 (UNQ568) 

cDNA, wherein SEQ ID NO:263 is a clone designated herein as "DNA59213-1487". 

Figure 182 shows the amino acid sequence (SEQ ID NO:264) derived from the coding sequence of 
SEQ ID NO:263 shown in Figure 181. 

Figure 183 shows a nucleotide sequence (SEQ ID NO:265) of a native sequence PRO1068 (UNQ525) 
30 cDNA, wherein SEQ ID NO:265 is a clone designated herein as "DNA59214-1449". 

Figure 184 shows the amino acid sequence (SEQ ID NO:266) derived from the coding sequence of 
SEQ ID NO:265 shown in Figure 183, 

Figure 185 shows a nucleotide sequence (SEQ ID NO:267) of a native sequence PRO1066 (UNQ524) 
cDNA, wherein SEQ ID NO:267 is a clone designated herein as "DNA59215-1425". 
35 Figure 186 shows the amino acid sequence (SEQ ID NO:268) derived from the coding sequence of 

SEQ ID NO:267 shown in Figure 185. 
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Figure 187 shows a nucleoride sequence (SEQ ID NO:269) of a native sequence PROl 184 (UNQ598) 
cDNA, wherein SEQ ID NO:269 is a clone designated herein as "DNA59220-1514". 



SEQ ID NO:269 shown in Figure 187. 

Figure 189 shows a nucleotide sequence (SEQ ID NO:27 1) of a native sequence PRO1360 (UNQ709) 
cDNA, wherein SEQ ID NO:271 is a clone designated herein as "DNA59488-1603\ 

Figure 190 shows the amino acid sequence (SEQ ID NO:272) derived from the coding sequence of 
SEQ ID NO:271 shown in Figure 189. 

Figure 191 shows a nucleotide sequence (SEQ ID NO:273) of anative sequence PRO1029 (UNQ514) 
cDNA, wherein SEQ ID NO:273 is a clone designated herein as "DNA59493-1420". 

Figure 192 shows the amino acid sequence (SEQ ID NO: 274) derived from the coding sequence of 
SEQ ID NO:273 shown in Figure 191. 

Figure 193 shows a nucleotide sequence (SEQ ID NO:275) of a native sequence PROl 139 (UNQ577) 
cDNA, wherein SEQ ID NO:275 is a clone designated herein as "DNA59497-1496". 

Figure 194 shows the amino acid sequence (SEQ ID NO:276) derived from the coding sequence of 
SEQ ID NO:275 shown in Figure 193. 

Figure 195 shows a nucleotide sequence (SEQ ID NO:277) of a native sequence PRO 1309 (UNQ675) 
cDNA, wherein SEQ ID NO:277 is a clone designated herein as "DNA59588-157r. 

Figure 196 shows the amino acid sequence (SEQ ID NO:278) derived from the coding sequence of 
SEQ ID NO:277 shown in Figure 195. 

Figure 197 shows a nucleotide sequence (SEQ ID NO:280) of a native sequence PRO1028 (UNQ5 1 3) 
cDNA, wherein SEQ ID NO:280 is a clone designated herein as "DNA59603-1419". 

Figure 198 shows the amino acid sequence (SEQ ID NO:281) derived from the coding sequence of 
SEQ ID NO:280 shown in Figure 197, 

Figure 199 shows a nucleotide sequence (SEQ ID NO:282) of a native sequence PRO1027 (UNQ5 12) 
cDNA, wherein SEQ ID NO:282 is a clone designated herein as "DNA59605-1418". 

Figure 200 shows the amino acid sequence (SEQ ID NO:283) derived from die coding sequence of 
SEQ ID NO:282 shown in Figure 199. 

Figure 201 shows a nucleotide sequence (SEQ ID NO:284) of a native sequence PROl 107 (UNQ550) 
cDNA, wherein SEQ ID NO:284 is a clone designated herein as ''DNA59606-147r . 

Figure 202 shows the amino acid sequence (SEQ ID NO:285) derived from the coding sequence of 
SEQ ID NO:284 shown in Figure 201. 

Figure 203 shows a nucleotide sequence (SEQ ID NO:286) of a native sequence PROl 140 (UNQ578) 
cDNA, wherein SEQ ID NO:286 is a clone designated herein as "DNA59607-1497". 

Figure 204 shows the amino acid sequence (SEQ ID NO:287) derived from the coding sequence of 
SEQ ID NO:286 shown in Figure 203. 

Figure 205 shows a nucleotide sequence (SEQ ID NO:288) of a native sequence PROl 106 (UNQ549) 
cDNA, wherein SEQ ID NO:288 is a clone designated herein as •DNA59609-1470". 



Figure 188 shows the amino acid sequence (SEQ ID NO:270) derived from the coding sequence of 
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Figure 206 shows the amino acid sequence (SEQ ID NO:289) derived from the coding sequence of 
SEQ ID NO:288 shown in Figure 205. 



cDNA, wherein SEQ ID NO:290 is a clone designated herein as "DNA59610-1556". 

Figure 208 shows the amino acid sequence (SEQ ID NO:291) derived from the coding sequence of 
SEQ ID NO:290 shown in Figure 207. 

Figure 209 shows a nucleotide sequence (SEQ ID NO:292) of a native sequence PROl 105 (UNQ548) 
cDNA. wherein SEQ ID NO:292 is a clone designated herein as "DNA59612-1466". 

Figure 210 shows the amino acid sequence (SEQ ID NO:293) derived from the coding sequence of 
SEQ ID NO:292 shown in Figure 209. 

Figure 21 1 shows a nucleotide sequence (SEQ ID NO:294) of a native sequence PR051 1 (UNQ511) 
cDNA, wherein SEQ ID NO:294 is a clone designated herein as "DNA59613-1417". 

Figure 212 shows the amino acid sequence (SEQ ID NO:295) derived from the coding sequence of 
SEQ ID NO: 294 shown in Figure 211. 

Figure 213 shows a nucleotide sequence (SEQ ID NO:296) of a native sequence PROl 104 (UNQ547) 
cDNA, wherein SEQ ID NO:296 is a clone designated herein as "DNA59616-1465". 

Figure 214 shows the amino acid sequence (SEQ ID NO:297) derived from the coding sequence of 
SEQ ID NO:296 shown in Figure 213. 

Figure 215 shows a nucleotide sequence (SEQ ID NO:298) of a native sequence PROl 100 (UNQ546) 
cDNA, wherein SEQ ID NO:298 is a clone designated herein as ''DNA59619-1464". 

Figure 216 shows the amino acid sequence (SEQ ID NO:299) derived from the coding sequence of 
SEQ ID NO:298 shown in Figure 215. 

Figure 217 shows a nucleotide sequence (SEQ ID NO:300) of a native sequence PR0836 (UNQ545) 
cDNA, wherein SEQ ID NO:300 is a clone designated herein as "DNA59620-1463". 

Figure 218 shows the amino acid sequence (SEQ ID NO:301) derived from the coding sequence of 
SEQ ID NO:300 shown in Figure 217. 

Figure 219 shows a nucleotide sequence (SEQ ID NO:302) of anative sequence PROl 141 (UNQ579) 
cDNA, wherein SEQ ID NO:302 is a clone designated herein as "DNA59625-1498". 

Figure 220 shows the amino acid sequence (SEQ ID NO:303) derived from the coding sequence of 
SEQ ID NO:302 shown in Figure 219. 

Figure 221 shows a nucleotide sequence designated herein as DNA33128 (SEQ ID NO:304). 

Figure 222 shows a nucleotide sequence designated herein as DNA34256 (SEQ ID NO:305). 

Figure 223 shows a nucleotide sequence designated herein as DNA47941 (SEQ ID NO:306). 

Figure 224 shows a nucleotide sequence designated herein as DNA54389 (SEQ ID NO:307), 

Figure 225 shows a nucleotide sequence (SEQ ID NO:308) of a native sequence PROl 132 (UNQ570) 
cDNA, wherein SEQ ID NO:308 is a clone designated herein as "DNA59767-1489". 

Figure 226 shows the amino acid sequence (SEQ ID NO:309) derived from the coding sequence of 
SEQ ID NO:308 shown in Figure 225. 



Figiire 207 shows a nucleotide sequence (SEQ ID NO:290) of a native sequence PR01291 (UNQ659) 
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Figure 227 shows a nucleotide sequence (SEQ [D NO:313) of a native sequence PR01346 cDNA, 
wherein SEQ ID NO:313 is a clone designated herein as "DNA59776-1600" . 

Figure 228 shows the amino acid sequence (SEQ ID N0:3 14) derived from the coding sequence of 
SEQ ID N0:313 shown in Figure 227. 

Figure 229 shows a nucleotide sequence (SEQ ID N0:3 18) of a native sequence PROl 131 (UNQ569) 
cDNA, wherein SEQ ID NO:3I8 is a clone designated herein as "DNA59777-1480". 

Figure 230 shows the amino acid sequence (SEQ ID NO:319) derived from the coding sequence of 
SEQ ID NO:3I8 shown in Figure 229. 

Figure 231 shows a nucleotide sequence designated herein as DNA43546 (SEQ H) NO:320). 

Figure 232 shows a nucleotide sequence (SEQ ID NO:325) of a native sequence PR01281 (UNQ651) 
cDNA, wherein SEQ ID NO:325 is a clone designated herein as "DNA59820-1549". 

Figure 233 shows the amino acid sequence (SEQ ID NO:326) derived from the coding sequence of 
SEQ ID NO:325 shown in Figure 232. 

Figure 234 shows a nucleotide sequence (SEQ ID NO:333) of a native sequence PRO1064 (UNQl 1 1) 
cDNA, wherein SEQ ID NO:333 is a clone designated herein as "DNA59827-1426". 

Figure 235 shows the amino acid sequence (SEQ ID NO:334) derived from the coding sequence of 
SEQ ID NO:333 shown in Figure 234. 

Figure 236 shows a nucleotide sequence designated herein as DNA45288 (SEQ ID NO:335). 

Figure 237 shows a nucleotide sequence (SEQ ID NO:339) of a native sequence PR01379 (UNQ716) 
cDNA, wherein SEQ ID NO:339 is a clone designated herein as "DNA59828-I608". 

Figure 238 shows the amino acid sequence (SEQ ID NO:340) derived from the coding sequence of 
SEQ ID NO:339 shown in Figure 237. 

Figure 239 shows a nucleotide sequence (SEQ ID NO: 344) of a native sequence PR0844 (UNQ544) 
cDNA, wherein SEQ ID NO:344 is a clone designated herein as "DNA5 9838- 1462". 

Figujre 240 shows the amino acid sequence (SEQ ID NO:345) derived from the coding sequence of 
SEQ ID NO:344 shown in Figure 239. 

Figure 241 shows a nucleotide sequence (SEQ ID NO: 346) of a native sequence PR0848 (UNQ543) 
cDNA, wherein SEQ ID NO:346 is a clone designated herein as "DNA59839-146r. 

Figtire 242 shows the amino acid sequence (SEQ ID NO:347) derived from the coding sequence of 
SEQ ID NO:346 shown in Figure 241. 

Figure 243 shows a nucleotide sequence (SEQ ID NO:348) of a native sequence PRO1097 (UNQ542) 
cDNA, wherein SEQ ID NO:348 is a clone designated herein as "DNA59841-1460". 

Figure 244 shows the amino acid sequence (SEQ ID NO: 349) derived from the coding sequence of 
SEQ ID NO: 348 shown in Figure 243. 

Figure 245 shows a nucleotide sequence (SEQ ID NO:350) of a native sequence PROl 153 (UNQ583) 
cDNA, wherein SEQ ID NO:350 is a clone designated herein as "DNAS 9842- 1502". 

Figure 246 shows the amino acid sequence (SEQ ID NO:35I) derived from the coding sequence of 
SEQ ID NO:350 shown in Figure 245. 
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Figure 247 shows a nucleotide sequence (SEQ ID NO:352) of a native sequence PROl 1 54 (UNQ584) 
cDNA, wherein SEQ ID NO:352 is a clone designated herein as "DNA59846-1503". 

Figure 248 shows the amino acid sequence (SEQ ID NO:353) derived from the coding sequence of 
SEQ ID NO:352 shown in Figure 247. 

Figure 249 shows a nucleotide sequence (SEQ ID NO:354) of a native sequence PROl 181 (UNQ595) 
cDNA, wherein SEQ ID NO:354 is a clone designated herein as "DNA59847-151 1". 

Figure 250 shows the amino acid sequence (SEQ ID NO:355) derived from die coding sequence of 
SEQ ID NO:354 shown in Figure 249. 

Figure 251 shows a nucleotide sequence (SEQ ID NO:356) of a native sequence PROl 182 (UNQ596) 
cDNA, wherein SEQ ID NO:356 is a clone designated herein as "DNA59848-1512". 

Figure 252 shows die amino acid sequence (SEQ ID NO:357) derived from the coding sequence of 
SEQ ID NO:356 shown in Figure 251. 

Figure 253 shows a nucleotide sequence (SEQ ID NO:358) of a native sequence PROl 155 (UNQ585) 
cDNA, wherein SEQ ID NO:358 is a clone designated herein as "DNA59849-1504". 

Figure 254 shows the amino acid sequence (SEQ ID NO:359) derived ftx)m the coding sequence of 
SEQ ID NO:358 shown in Figure 253. 

Figure 255 shows a nucleotide sequence (SEQ ID NO:360) of a native sequence PROl 156 (UNQ586) 
cDNA, wherein SEQ ID NO:360 is a clone designated herein as "DNA59853-1505". 

Figure 256 shows the amino acid sequence (SEQ ID NO:361) derived from the coding sequence of 
SEQ ID NO:360 shown in Figure 255. 

Figure 257 shows a nucleotide sequence (SEQ ID NO:362) of a native sequence PRO1098 (UNQ541) 
cDNA, wherein SEQ ID NO:362 is a clone designated herein as "DNA59854-1459". 

Figure 258 shows the amino acid sequence (SEQ ID NO:363) derived from the coding sequence of 
SEQ ID NO:362 shown in Figure 257. 

Figure 259 shows a nucleotide sequence (SEQ ID NO:364) of a native sequence PROl 127 (UNQ565) 
cDNA, wherein SEQ ID NO:364 is a clone designated herein as "DNA60283-1484". 

Figure 260 shows the amino acid sequence (SEQ ID NO:365) derived from the coding sequence of 
SEQ ID NO:364 shown in Figure 259. 

Figure 261 shows a nucleotide sequence (SEQ ID NO:366) of a native sequence PROl 126 (UNQ564) 
cDNA, wherein SEQ ID NO:366 is a clone designated herein as "DNA60615-1483". 

Figure 262 shows the amino acid sequence (SEQ ID NO:367) derived from the coding sequence of 
SEQ ID NO:366 shown in Figure 261 . 

Figure 263 shows a nucleotide sequence (SEQ ID NO:368) of a native sequence PROl 125 (UNQ563) 
cDNA, wherein SEQ ID NO:368 is a clone designated herein as "DNA60619-1482". 

Figure 264 shows the amino acid sequence (SEQ ID NO:369) derived from the coding sequence of 
SEQ ID NO:368 shown in Figure 263. 

Figure 265 shows a nucleotide sequence (SEQ ID NO:370) of a native sequence PROl 1 86 (UNQ600) 
cDNA, wherein SEQ ID NO: 370 is a clone designated herein as "DNA60621-1516". 
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Figure 266 shows the amino add sequence (SEQ ID NO:371) derived from the coding sequence of 
SEQ ID NO:370 shown in Figure 265. 

Figure 267 shows a nucleotide sequence (SEQ ID NO:372) of a native sequence PROl 198 (UNQ61 1) 
cDNA, wherein SEQ ID NO:372 is a clone designated herein as "DNA60622-1525". 

Figure 268 shows the amino acid sequence (SEQ ID NO: 373) derived from the coding sequence of 
SEQ ID NO:372 shown in Figure 267. 

Figure 269 shows a nucleotide sequence (SEQ ID NO:374) of a native sequence PROl 158 (UNQ588) 
cDNA, wherein SEQ ID NO:374 is a clone designated herein as "DNA60625-1507". 

Figure 270 shows the amino acid sequence (SEQ ID NO:375) derived from the coding sequence of 
SEQ ID NO:374 shown in Figure 269. 

Figure 271 shows a nucleotide sequence (SEQ ID NO:376) of a native sequence PROl 159 (UNQ589) 
cDNA, wherein SEQ ID NO:376 is a clone designated herein as "DNA60627-1508". 

Figure 272 shows the amino acid sequence (SEQ ID NO:377) derived from the coding sequence of 
SEQ ID NO:376 shown in Figure 271. 

Figure 273 shows a nucleotide sequence (SEQ ID NO:378) of a native sequence PROl 124 (UNQ562) 
cDNA, wherein SEQ ID NO:378 is a clone designated herein as "DNA60629-148r. 

Figure 274 shows the amino acid sequence (SEQ ID NO:379) derived from the coding sequence of 
SEQ ID NO:378 shown in Figure 273. 

Figure 275 shows a nucleotide sequence (SEQ ID NO:380) of a native sequence PR01287 (UNQ656) 
cDNA. wherein SEQ ID NO:380 is a clone designated herein as '•DNA61755-1554". 

Figure 276 shows the amino acid sequence (SEQ ID NO:381) derived from the coding sequence of 
SEQ ID NO:380 shown in Figure 275. 

Figure 277 shows a nucleotide sequence (SEQ ID NO: 386) of a native sequence PR01312 (UNQ678) 
cDNA, wherein SEQ ID NO:386 is a clone designated herein as ''DNA61873-1574". 

Figure 278 shows the amino acid sequence (SEQ ID NO: 387) derived from the coding sequence of 
SEQ ID NO:386 shown in Figure 277. 

Figure 279 shows a nucleotide sequence (SEQ ID NO:388) of a native sequence PROl 192 (UNQ606) 
cDNA, wherein SEQ ID NO:388 is a clone designated herein as ''DNA628 14-1521". 

Figure 280 shows the amino acid sequence (SEQ ID NO: 389) derived from the coding sequence of 
SEQ ID NO:388 shown in Figure 279. 

Figure 281 shows a nucleotide sequence (SEQ ID NO:393) of a native sequence PROn60 (UNQ590) 
cDNA, wherein SEQ ID NO:393 is a clone designated herein as "DNA62872-1509". 

Figure 282 shows the amino acid sequence (SEQ ID NO: 394) derived from the coding sequence of 
SEQ ID NO:393 shown in Figure 281. 

Figure 283 shows a nucleotide sequence (SEQ ID NO:398) of a native sequence PROl 187 (UNQ60i) 
cDNA, wherein SEQ ID NO:398 is a clone designated herein as "DNA62876-1517". 

Figure 284 shows the amino acid sequence (SEQ ID NO: 399) derived from the coding sequence of 
SEQ ID NO:398 shown in Figure 283. 
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Figure 285 shows a nucleotide sequence (SEQ ID NO:400) of a native sequence PROl 185 (UNQ599) 
cDNA, wherein SEQ ID NO:400 is a clone designated herein as "DNA62881-1515". 

Figure 286 shows the amino acid sequence (SEQ ID NO:401) derived from the coding sequence of 
SEQ ID NO:400 shown in Figure 285. 

Figure 287 shows a nucleotide sequence (SEQ ID NO:402) of a native sequence PR01345 (UNQ700) 
cDNA, wherein SEQ ID NO:402 is a clone designated herein as "DNA64852-1589". 

Figure 288 shows the amino acid sequence (SEQ ID NO:403) derived from the coding sequence of 
SEQ ID NO:402 shown in Figure 287. 

Figure 289 shows a nucleotide sequence (SEQ ID NO:407) of a native sequence PR01245 (UNQ629) 
cDNA, wherein SEQ ID NO:407 is a clone designated herein as "DNA64884-1527". 

Figure 290 shows the amino acid sequence (SEQ ID NO:408) derived from the coding sequence of 
SEQ ID NO:407 shown in Figure 289. 

Figure 291 shows a nucleotide sequence (SEQ ID NO:409) of a native sequence PRO 1 358 (UNQ707) 
cDNA, wherein SEQ ID NO:409 is a clone designated herein as "DNA64890-1612". 

Figure 292 shows the amino acid sequence (SEQ ID NO:410) derived from the coding sequence of 
SEQ ID NO:409 shown in Figure 291. 

Figure 293 shows a nucleotide sequence (SEQ ID N0:41 1) of a native sequence PRO! 195 (UN(J608) 
cDNA, wherein SEQ ID N0:411 is a clone designated herein as "DNA65412-1523". 

Figure 294 shows the amino acid sequence (SEQ ID N0:412) derived from the coding sequence of 
SEQ ID NO:4l 1 shown in Figure 293. 

Figure 295 shows a nucleotide sequence (SEQ ID NO:4l3) of a native sequence PRO1270 (UNC2640) 
cDNA. wherein SEQ ID NO:413 is a clone designated herein as •DNA66308-1537\ 

Figure 296 shows the amino acid sequence (SEQ ID NO:414) derived firom the coding sequence of 
SEQ ID NO:413 shown in Figure 295. 

Figure 297 shows a nucleotide sequence (SEQ ID NO:4 15) of a native sequence PR01271 (UNQ641) 
cDNA, wherein SEQ ID N0:415 is a clone designated herein as "DNA66309-1538". 

Figure 298 shows the amino acid sequence (SEQ ID NO:416) derived from the coding sequence of 
SEQ ID N0:415 shown in Figure 297, 

Figure 299 shows a nucleotide sequence (SEQ ID N0:417) of a native sequence PR01375 (UNQ712) 
cDNA, wherein SEQ ID NO:417 is a clone designated herein as "DNA67(X)4-1614". 

Figure 300 shows the amino acid sequence (SEQ ID NO:418) derived from the coding sequence of 
SEQ ID NO:417 shown in Figure 299. 

Figure 301 shows a nucleotide sequence (SEQ ID NO:419) of anative sequence PR01385 (UNQ720) 
cDNA, wherein SEQ ID NO:419 is a clone designated herein as "DNA68869-1610". 

Figure 302 shows the amino acid sequence (SEQ ID NO:420) derived from the coding sequence of 
SEQ ID NO:419 shown in Figure 301. 

Figure 303 shows a nucleotide sequence (SEQ ID NO:421) of a native sequence PROl 387 (UNQ722) 
cDNA, wherein SEQ ID NO:421 is a clone designated herein as "DNA68872-1620". 
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Figure 304 shows the amino acid sequence (SEQ ID NO:422) derived from the coding sequence of 
SEQ ID NO:421 shown in Figure 303. 

Figure 305 shows a nucleotide sequence (SEQ ID NO:423) of a native sequence PR01384 (UNQ721) 
cDNA, wherein SEQ ID NO:423 is a clone designated herein as "DNA71 159-1617". 

Figure 306 shows the amino acid sequence (SEQ ID NO;424) derived from the coding sequence of 
SEQ ID NO:423 shown in Figure 305. 

Figure 307 shows a nucleotide sequence (SEQ ID NO:494) of a native sequence PR0183 cDNA, 
wherein SEQ ID NO:494 is a clone designated herein as "DNA28498". 

Figure 308 shows the amino acid sequence (SEQ ID NO:495) derived from the coding sequence of 
SEQ ID NO:494 shown in Figure 307. 

Figure 309 shows a nucleotide sequence (SEQ ID NO:496) of a native sequence PR0184 cDNA, 
wherein SEQ ID NO:496 is a clone designated herein as •DNA28500". 

Figure 310 shows the amino acid sequence (SEQ ID NO:497) derived from the coding sequence of 
SEQ ID NO:496 shown in Figure 309. 

Figure 311 shows a nucleotide sequence (SEQ ID NO:498) of a native sequence PR0185 cDNA, 
wherein SEQ ID NO:498 is a clone designated herein as •DNA28503". 

Figure 312 shows the amino acid sequence (SEQ ID NO: 499) derived from the coding sequence of 
SEQ ID NO:498 shown in Figure 311, 

Figure 313 shows a nucleotide sequence (SEQ ID NO:500) of a native sequence PR0331 cDNA, 
wherein SEQ ID NO:500 is a clone designated herein as "DNA40981-1234". 

Figure 314 shows the amino acid sequence (SEQ ID NO:501) derived from the coding sequence of 
SEQ ID NO:500 shown in Figure 313. 

Figure 315 shows a nucleotide sequence (SEQ ID NO:502) of a native sequence PR0363 cDNA, 
wherein SEQ ID NO:502 is a clone designated herein as "DNA45419-1252". 

Figure 316 shows the amino acid sequence (SEQ ID NO:503) derived from the coding sequence of 
SEQ ID NO:502 shown in Figure 315. 

Figure 317 shows a nucleotide sequence (SEQ ID NO:504) of a native sequence PR05723 cDNA. 
wherein SEQ ID NO:504 is a clone designated herein as "DNA8236r. 

Figure 318 shows the amino acid sequence (SEQ ID NO:505) derived frxim the coding sequence of 
SEQ ID NO:504 shown in Figure 317. 

Figure 319 shows a nucleotide sequence (SEQ ID NO:506) of a native sequence PRO3301 cDNA, 
wherein SEQ ID NO:506 is a clone designated herein as "DNA88002". 

Figure 320 shows the amino acid sequence (SEQ ID NO: 507) derived from the coding sequence of 
SEQ ID NO:506 shown in Figure 319. 

Figure 321 shows a nucleotide sequence (SEQ ID NO: 508) of a native sequence PRO9940 cDNA, 
wherein SEQ ID NO:508 is a clone designated herein as "DNA92282". 

Figure 322 shows the amino acid sequence (SEQ ID NO:509) derived from the coding sequence of 
SEQ ID NO:508 shown in Figure 321. 



303 



wo 00/73454 




PCT/USOO/08439 



Figure 323 shows a nucleotide sequence (SEQ ID NO:510) of a native sequence PR09828 cDNA, 
wherein SEQ ID NO:510 is a clone designated herein as •DNA142238-2768". 

Figure 324 shows the amino acid sequence (SEQ ID N0:5n) derived from the coding sequence of 
SEQ ID N0:510 shown in Figure 323. 

Figure 325 shows a nucleotide sequence (SEQ ID NO:512) of a native sequence PRO7170 cDNA, 
wherein SEQ ID NO:512 is a clone designated herein as "DNA 108722-2743". 

Figure 326 shows the amino acid sequence (SEQ ID NO:513) derived from the coding sequence of 
SEQ ID NO:512 shown in Figure 325. 

Figure 327 shows a nucleotide sequence (SEQ ID NO:514) of a native sequence PR0361 cDNA, 
wherein SEQ ID NO:514 is a clone designated herein as "DNA45410-1250". 

Figure 328 shows the amino acid sequence (SEQ ID NO:515) derived from the coding sequence of 
SEQ ID NO:514 shown in Figure 327. 

Figure 329 shows a nucleotide sequence (SEQ ID N0:516) of a native sequence PR0846 cDNA, 
wherein SEQ ID NO:516 is a clone designated herein as "DNA44 196- 1353". 

Figure 330 shows the amino acid sequence (SEQ ID NO:517) derived from the coding sequence of 
SEQ ID NO:516 shown in Figure 329. 



The terms "PRO polypeptide" and "PRO" as used herein and when immediately followed by a 
numerical designation refer to various polypeptides, wherein the complete designation (i.e., PRO/number) 
refers to specific polypeptide sequences as described herein. The terras "PRO/number polypeptide" and 
"PRO/number" wherein the term "number" is provided as an actual numerical designation as used herein 
encompass native sequence polypeptides and polypeptide variants (which are further defined herein) . The PRO 
polypeptides described herein may be isolated from a variety of sources, such as from human tissue types or 
from another source, or prepared by recombinant or synthetic methods. 

A "native sequence PRO polypeptide" conqjrises a polypeptide having the same amino acid sequence 
as the conesponding PRO polypeptide derived from nature. Such native sequence PRO polypeptides can be 
isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence PRO 
polypeptide" specifically encon^asses naturally-occurring truncated or secreted forms of the specific PRO 
polypeptide (e.g., an extracellular domain sequence), naturally-occurring variant forms {e.g., alternatively 
spliced forms) and namrally-occurring allelic variants of the polypeptide. In various embodiments of the 
invention, the native sequence PRO polypeptides disclosed herein are mature or full-length native sequence 
polypeptides comprising the full-length amino acids sequences shown in the accompanying figures. Start and 
stop codons are shown in bold font and underlined in the figures. However, while the PRO polypeptide 
disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino 
acid position 1 in the figures, it is conceivable and possible that other methionine residues located either 
upstream or downstream from the amino acid position 1 in the figures may be employed as the starting amino 



I. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Definitions 
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acid residue for the PRO polypeptides. 

The PRO polypeptide "extracellular domain" or "ECD" refers to a form of the PRO polypeptide 
which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide ECD 
will have less than 1 % of such transmembrane and/or cytoplasmic domains and preferably, will have less than 
0.5% of such domains. It will be understood that any transmembrane domains identified for the PRO 
polypeptides of the present invention are identified pursuant to criteria routinely employed in the art for 
identifying that type of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but 
most likely by no more than about 5 amino acids at either end of the domain as initially identified herein. 
Optionally, therefore, an extracellular domain of a PRO polypeptide may contain from about 5 or fewer amino 
acids on either side of the transmembrane domain/extracellular domain botmdary as identified in the Examples 
or specification and such polypeptides, with or without the associated signal peptide, and nucleic acid encoding 
them, are comtemplated by the present invention. 

The approximate location of the "signal peptides" of the various PRO polypeptides disclosed herein 
are shown in the present specification and/or the accompanying figures. It is noted, however, (hat die C- 
terminal boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on eidier 
side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary 
of the signal peptide may be identified pursuaiu to criteria routinely employed in the art for identifying that type 
of amino acid sequence element (e.g., Nielsen et al., Prot. Eng. 10: 1-6 (1997) and von Heinje et al., Nucl. 
Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal 
sequence from a secreted polypeptide is not entirely uniform, resulting in more tiian one secreted species. 
These mature polypeptides, where the signal peptide is cleaved within no more than about 5 amino acids on 
either side of the C-terminal boundary of the signal peptide as identified herein, and the polynucleotides 
encoding them, are contemplated by the present invention. 

"PRO polypeptide variant" means an active PRO polypeptide as defined above or below having at least 
about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence as 
disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any other fragment 
of a full-length PRO polypeptide sequence as disclosed herein. Such PRO polypeptide variants include, for 
instance, PRO polypeptides wherein one or more amino acid residues are added, or deleted, at the N- or C- 
termmus of the full-length native amino acid sequence. Ordinarily, a PRO polypeptide variant will have at 
least about 80% amino acid sequence identity, preferably at least about 81% amino acid sequence identity, 
more preferably at least about 82% anuno acid sequence identity, more preferably at least about 83% amino 
acid sequence identity, more preferably at least about 84% amino acid sequence identity, more preferably at 
least about 85% amino acid sequence identity, more preferably at least about 86% amino acid sequence 
identity, more preferably at least about 87% amino acid sequence identity, more preferably at least about 88% 
amino acid sequence identity, more preferably at least about 89% amino acid sequence identity, more 
preferably at least about 90% amino acid sequence identity, more preferably at least about 91% amino acid 
sequence identity, more preferably at least about 92% amino acid sequence identity, more preferably at least 
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about 93% amino acid sequence identity, more preferably at least about 94% amino acid sequence identity, 
more preferably at least about 95% amino acid sequence identity, more preferably at least about 96% amino 
acid sequence identity, more preferably at least about 97% amino acid sequence identity, more preferably at 
least about 98% amino acid sequence identity and most preferably at least about 99% amino acid sequence 
identity with a full-length native sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide 
5 sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with 
or without the signal peptide, as disclosed herein or any other specifically defined fragment of a full-length 
PRO polypeptide sequence as disclosed herein. Ordinarily, PRO variant polypeptides are at least about 10 
amino acids in length, often at least about 20 amino acids in length, more often at least about 30 amino acids 
in length, more often at least about 40 amino acids in length, more often at least about 50 amino acids in 

10 length, more often at least about 60 amino acids in length, more often at least about 70 amino acids in length, 
more often at least about 80 amino acids in length, more often at least about 90 amino acids in length, more 
often at least about 100 amino acids in length, more often at least about 150 amino acids in length, more often 
at least about 200 amino acids in length, more often at least about 300 amino acids in length, or more. 

"Percem (%) amino acid sequence identity" with respect to the PRO polypeptide sequences identified 

15 herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the specific PRO polypeptide sequence, after aligning the sequences and introducing 
gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative 
substimtions as part of the sequence identity. Alignment for purposes of determining percent amino acid 
sequence identity can be achieved in various ways that are within the skill in the art, for instance, using 

20 publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. 
Those skilled in the art can determine appropriate parameters for measuring alignment, including any 
algorithms needed to achieve maximal alignment over the full length of the sequences being compared. For 
purposes herein, however, % amino acid sequence identity values are generated using the sequence comparison 
computer program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in 

25 Table 1 below. The ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and 
the source code shown in Table 1 below has been filed with user documentation in the U.S. Copyright Office, 
Washington D.C., 20559, where it is registered under U.S. Copyright Registtation No. TXU5 10087. The 
ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be 
compiled from the source code provided in Table 1 below. The ALIGN-2 program should be con^iled for 

30 use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are 
set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid 

35 sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 
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where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in 
B. It will be appreciated that v^ere the length of amino acid sequence A is not equal to the length of amino 
acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence 
identity of B to A. As examples of % amino acid sequence identity calculations using this method. Tables 2 
5 and 3 demonstrate how to calculate the % amino acid sequence identity of the amino acid sequence designated 
"Comparison Protein** to the amino acid sequence designated "PRO**, wherein "PRO" represents the amino 
acid sequence of a hypothetical PRO polypeptide of interest, "Comparison Protein" represents the amino acid 
sequence of a polypeptide against which the "PRO" polypeptide of interest is being compared, and "X, " Y" 
and "Z" each represent different hypothetical amino acid residues. 

10 Unless specifically stated otherwise, all % amino acid sequence identity values used herein are 

obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 
However, % amino acid sequence identity values may also be obtained as described below by using the WU- 
BLAST-2 computer program (Altschul et al. , Methods inEnzvmologv 266:460-480 (1996)). Most of the WU- 
BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the adjustable 

15 parameters, are set with the following values: overlap span = 1» overlap fraction = 0.125, word threshold 
(T) = 11, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed, a % amino acid sequence 
identity value is determined by dividing (a) the number of matching identical amino acid residues between the 
amino acid sequence of the PRO polypeptide of interest having a sequence derived from the native PRO 
polypeptide and the comparison amino acid sequence of interest (i.e., the sequence against which the PRO 

20 polypeptide of interest is being compared which may be a PRO variant polypeptide) as determined by WU- 
BLAST-2 by (b) the total number of amino acid residues of the PRO polypeptide of interest. For example, 
in the statement **a polypeptide comprising an the amino acid sequence A which has or having at least 80% 
amino acid sequence identity to the amino acid sequence B", the amino acid sequence A is the comparison 
amino acid sequence of interest and the amino acid sequence B is the amino acid sequence of flie PRO 

25 polypeptide of interest. 

Percent amino acid sequence identity may also be determined using the sequence comparison program 
NCBI-BLAST2 (Altschul et al.. Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
comparison program may be downloaded from http://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several 
search parameters, wherein all of those search parameters are set to default values including, for example, 

30 unmask = yes, strand - all, expected occurrences = 10, minimum low complexity length = 15/5, muhi-pass 
e-value = 0.01, constant for multi-pass - 25, dropoff for final gapped alignment = 25 and scoring matrix 
= BLOSUM62. 

In simations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % amino 
acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B 
35 (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino 
acid sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 
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100 times the fraction X/Y 



where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid 
residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length 
of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid 
sequence identity of B to A. 

"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid molecule 
which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid 
sequence identity with a nucleotide acid sequence encoding a full-length native sequence PRO polypeptide 
sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide 
as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, as 
disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed herein. 
Ordinarily, a PRO variant polynucleotide will have at least about 80% nucleic acid sequence identity, more 
preferably at least about 81 % nucleic acid sequence identity, more preferably at least about 82% nucleic acid 
sequence identity, more preferably at least about 83% nucleic acid sequence identity, more preferably at least 
about 84% nucleic acid sequence identity, more preferably at least about 85% nucleic acid sequence identity, 
more preferably at least about 86% nucleic acid sequence identity, more preferably at least about 87% nucleic 
acid sequence identity, more preferably at least about 88% nucleic acid sequence identity, more preferably at 
least about 89% nucleic acid sequence identity, more preferably at least about 90% nucleic acid sequence 
identity, more preferably at least about 91 % nucleic acid sequence identity, more preferably at least about 92% 
nucleic acid sequence identity, more preferably at least about 93% nucleic acid sequence identity, more 
preferably at least about 94% nucleic acid sequence identity, more preferably at least about 95 % nucleic acid 
sequence identity, more preferably at least about 96% nucleic acid sequence identity, more preferably at least 
about 97% nucleic acid sequence identity, more preferably at least about 98% nucleic acid sequence identity 
and yet more preferably at least about 99% nucleic acid sequence identity with a nucleic acid sequence 
encoding a full-length native sequence PRO polypeptide sequence as disclosed herein, a fiill-length native 
sequence PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of 
a PRO polypeptide, with or without the signal sequence, as disclosed herein or any other fragment of a full- 
length PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide 
sequence. 

Ordinarily, PRO variant polynucleotides are at least about 30 nucleotides in length, often af least about 
60 nucleotides in length, more often at least about 90 nucleotides in length, more often at least about 120 
nucleotides in length, more often at least about 150 nucleotides in length, more often at least about 180 
nucleotides in length, more often at least about 210 nucleotides in length, more often at least about 240 
nucleotides in length, more often at least about 270 nucleotides in length, more often at least about 300 
nucleotides in length, more often at least about 450 nucleotides in length, more often at least about 600 
nucleotides in length, more often at least about 900 nucleotides in length, or more. 
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. "Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining 
percent nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for 
5 instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign 
(DNASTAR) software. For purposes herein, however, % nucleic acid sequence identity values are generated 
using the sequence comparison computer program ALIGN-2, wherein the complete source code for the 
ALIGN-2 program is provided in Table 1 below. The ALIGN-2 sequence comparison computer program was 
authored by Genentech, Inc. and the source code shown in Table 1 below has been filed with user 

10 documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. 
Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available through Genentech, 
Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 below. 
The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX 
V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary. 

15 In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 

sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D 
(which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

20 1 00 times the fraction W/2 

where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN- 
2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be 
appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 

25 D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to 
C. Asexamplesof % nucleic acid sequence identity calculations, Tables4and5, demonstrate how to calculate 
the % nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic 
acid sequence designated "PRO-DNA" , wherein "PRO-DNA" represents a hypothetical PRO-encoding nucleic 
acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule 

30 against which the "PRO-DNA" nucleic acid molecule of interest is being compared, and "N", "L" and "V" 
each represent different hypothetical nucleotides. 

Unless specifically stated otherwise, alt % nucleic acid sequence identity values used herein are 
obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 
However, % nucleic acid sequence identity values may also be obtained as described below by using the WU- 

35 BLAST-2 computer program (Altschul et al . , Methods in Enzvmology 266:460-480 (1996)). Most of the WU- 
BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the adjustable 
parameters, are set with the following values: overlap span - I, overlap fraction = 0.125, word threshold 
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(J) - 1 1, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed, a % nucleic acid sequence 
identity value is determined by dividing (a) the number of matching identical nucleotides between the nucleic 
acid sequence of the PRO polypeptide-encoding nucleic acid molecule of interest having a sequence derived 
from the native sequence PRO polypeptide-encoding nucleic acid and the comparison nucleic acid molecule 
of interest (i.e., the sequence against which the PRO polypeptide-encoding nucleic acid molecule of interest 
5 is being compared which may be a variant PRO polynucleotide) as determined by WU-BLAST-2 by (b) the 
total number of nucleotides of the PRO polypeptide-encoding nucleic acid molecule of interest. For example, 
in the statement "an isolated nucleic acid molecule comprising a nucleic acid sequence A which has or having 
at least 80% nucleic acid sequence identity to the nucleic acid sequence B", the nucleic acid sequence A is the 
comparison nucleic acid molecule of interest and die nucleic acid sequence B is the nucleic acid sequence of 

10 the PRO polypeptide-encoding nucleic acid molecule of interest. 

Percent nucleic acid sequence identity may also be determined using the sequence comparison program 
NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
comparison program may be downloaded from hitp://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several 
seardi parameters, wherein all of those search parameters are set to default values including, for example, 

15 unmask = yes, strand = all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass 
e-value = 0.01, constant for multi-pass - 25, dropoff for fmal gapped alignment = 25 and scoring matrix 
= BLOSUM62, 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D 
20 (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

25 where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI- 
BLAST2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will 
be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid 
sequence D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity 
of DtoC. 

30 In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an active 

PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash 
conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO variant 
polypeptides may be those that are encoded by a PRO variant polynucleotide. 

The term "positives", in the context of sequence comparison performed as described above, includes 

35 residues in the sequences compared that are not identical but have similar properties (e.g. as a result of 
conservative substitutions, see Table 6 below). For purposes herein, the % value of positives is determined 
by dividing (a) the number of amino acid residues scoring a positive value between the PRO polypeptide amino 

310 



wo 00/73454 




PCT/USOO/08439 



acid sequence of interest having a sequence derived from the native PRO polypeptide sequence and the 
comparison amino acid sequence of interest (i.e., the amino acid sequence against which the PRO polypeptide 
sequence is being compared) as determined in the BLOSUM62 matrix of WU-BLAST-2 by (b) the total 
number of amino acid residues of the PRO polypeptide of interest. 

Unless specifically stated otherwise, the % value of positives is calculated as described in the 
immediately preceding paragraph. However, in the context of the amino acid sequence identity comparisons 
performed as described for ALIGN-2 and NCBI-BLAST-2 above, includes amino acid residues in the 
sequences compared that are not only identical, but also those that have similar properties. Amino acid 
residues that score a positive value to an amino acid residue of interest are those that are either identical to the 
amino acid residue of interest or are a preferred substitution (as defined in Table 6 below) of the amino acid 
residue of interest. 

For amino acid sequence comparisons using ALIGN-2 or NCBI-BLAST2, the % value of positives 
of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can alternatively 
be phrased as a given amino acid sequence A that has or comprises a certain % positives to, with, or against 
a given amino acid sequence B) is calculated as follows: 



100 times the fraction XA^ 

where X is the number of amino acid residues scoring a positive value as defined above by the sequence 
alignment program ALIGN-2 or NCBI-BLAST2 in that program's alignment of A and B, and where Y is the 
total number of amino acid residues in B. It will be appreciated that where the length of amino acid sequence 
A is not equal to the length of amino acid sequence B. the % positives of A to B will not equal the % positives 
of B to A. 

"Isolated," when used to describe the various polypeptides disclosed herein, means polypeptide that 
has been identified and separated and/or recovered from a component of its natural environment. Contaminant 
components of its natural environment are materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous 
solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 
15 residues of N-tenminal or internal amino acid sequence by use of a spiiming cup sequenator, or (2) to 
homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, preferably, 
silver stain. Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least one 
component of the PRO polypeptide naniral environment will not be present. Ordinarily, however, isolated 
polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is 
a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with 
which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An isolated 
polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature. 
Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptide- 
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encoding nucleic acid molecule as it exists in natural cells. However, an isolated polypeptide-encoding nucleic 
acid molecule includes polypeptide-encoding nucleic acid molecules conuined in cells that ordinarily express 
the polypeptide where, for example, the nucleic acid molecule is in a chromosomal location different from that 
of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
5 linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic 
cells are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is 'operably linked" when it is placed into a functional relationship with another nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 

10 polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter 
or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 

1 5 accon^l ished by 1 igation at convenient restriction sites . If such sites do not exist, the synthetic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 

The term "antibody** is used in the broadest sense and specifically covers, for example, single anti- 
PRO monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PRO antibody 
compositions with polyepitopic specificity, single chain anti-PRO antibodies, and fragments of anti-PRO 

20 antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from 
a population of substantially homogeneous antibodies, i.e. , the individual antibodies comprising the population 
are identical except for possible naturally-occurring mutations that may be present in minor amounts. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependentupon probe length, washing temperature, and salt concentration. 

25 In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denamred DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree 
of desired homology between the probe and hybridizable sequence, the higher the relative temperamre which 
can be used. As a result, it follows that higher relative temperatures would tend to make the reaction 

30 conditions more stringent, while lower temperatures less so. For additional details and explanation of 
stringency of hybridization reactions, see Ausubel et al.. Current Protocols in Molecular Biology . Wiley 
Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those 
that: (1) employ low ionic strength and high temperature for washing, for exan:q)Ie 0.015 M sodium 

35 chloride/0.0015 M sodium citrate/0.1 % sodium dodecyl sulfate at 50*C; (2) employ during hybridization a 
denaturing agent, such as formamide. for example, 50% (v/v) formamide with 0.1% bovine serum 
albuminyo.1% FicoU/O.l % polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 widi 750 mM 
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sodium chloride, 75 mM sodium citrate at 42''C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 
0.075 M sodium citrate). 50 mM sodium phosphate (pH 6.8). 0. 1 % sodium pyrophosphate, 5 x Denhardt's 
solution, sonicated salmon sperm DN A (50 ^g/ml), 0. 1 % SDS, and 10% dextran sulfate at 42'*C. with washes 
at 42 °C in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55 *C, followed by a high- 
stringency wash consisting of 0. 1 x SSC containing EDTA at 55 *C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al.. Molecular 
Cloning: A Laboratory Manual . New York: Cold Spring Harbor Press, 1989, and include the use of washing 
solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately stringent conditions is overnight incubation at 37**C in a solution 
comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5 X Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50*C. The skilled artisan will recognize how to adjust 
the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope 
against which an antibody can be made, yet is short enough such that it does not interfere with activity of the 
polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does 
not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino 
acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 
amino acid residues). 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e. . is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD 
or IgM. 

"Active" or "activity" for the purposes herein refers to form(s) of a PRO polypeptide which retain 
a biological and/or an inununological activity of native or naturally-occurring PRO. wherein "biological" 
activity refers to a biological fiinction (either inhibitory or stimulatory) caused by a native or namrally- 
occurring PRO other than the ability to induce the production of an antibody against an antigenic epitope 
possessed by a native or namrally-occurring PRO and an "immunologicar activity refers to the ability to 
induce the production of an antibody against an antigenic epitope possessed by a native or namrally-occurring 
PRO. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fiilly 
blocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a similar 
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manner, the term "agonist" is used in the broadest sei^e and includes any molecule that mimics a biological 
activity of a native PRO polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically 
include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of 
native PRO polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Methods for 
identifying agonists or antagonists of a PRO polypeptide may comprise contacting a PRO polypeptide with a 
5 candidate agonist or antagonist molecule and measuring a detectable change in one or more biological activities 
normally associated with the PRO polypeptide. 

"Treatment" refers to bodi therapeutic treatment and prophylactic or preventative measures, wherein 
the object is to prevent or slow down (lessen) die targeted pathologic condition or disorder. Those in need of 
treatment include those already with the disorder as well as those prone to have the disorder or those in whom 
10 the disorder is to be prevented. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to 
an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 
"Intermittent" administration is treatment that is not consecutively done without interruption, but rather is 
cyclic in nature. 

15 "Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, 

domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, pigs, 
goats, rabbits, etc. Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

20 "Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which 

are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 
the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum 

25 albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such 
as glycine, glutamine, asparagine, arginineor lysine; monosaccharides, disaccharides, and other carbohydrates 
including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or 
sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN~, polyethylene 
glycol (PEG), and PLURONICS™. 

30 "Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 

variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab*>2', and Fv 
fragments; diabodies; linear antibodies (Zapata et al., Protein Eng. 8(10): 1057-1062 [1995]); single-chain 
antibody molecules; and multispecific antibodies formed from antibody fragments. 



ability to crystallize readily. Pepsin treatment yields an F(ab')2 fragment that has two antigen-combining sites 
and is still capable of cross-linking antigen. 



35 



Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 
fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the 
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"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding 
site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to defme an 
antigen-binding site on the surface of the V^-Vl dimer. Collectively, the six CDRs confer antigen-binding 
specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three 
CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than 
the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab fragments differ from Fab* fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab* in which the cysteine residue(s) of the constant domains 
bear a free thiol group. F(ab')2 antibody fragments originally were produced as pairs of Fab' fragments which 
have hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) ftxim any vertebrate species can be assigned to 
one of two clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant 
domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, inmiunoglobulins 
can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, 
and IgM, and several of these may be ftirther divided into subclasses (isotypes), e.g., IgGl , IgG2, IgG3, IgG4, 
IgA, and IgA2. 

"Single-chain Fv" or "sFv" antibody fragments comprise the Vh and V,, domains of antibody, wherein 
these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
polypeptide linker between the Vf, and domains which enables the sFv to form the desired structure for 
antigen binding. For a review of sFv, see Pluckthun in The Pharmacoloev of Monoclonal Antibodies , vol. 
1 13, Rosenburg and Moore eds.. Springer- Verlag, New York, pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy-chain variable domain (V^) connected to a light-chain variable domain (V J in the 
same polypeptide chain (Vh-VJ. By using a linker that is too short to allow pairing between the two domains 
on the same chain, the domains are forced to pair with the complementary domains of another chain and create 
two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/1 1 161; 
and HoUinger et al., Proc. Natl. Acad. Sci. USA . 90:6444-6448 (1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a 
component of its natural environment. Contaminant components of its natural enviroiunent are materials which 
would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and 
other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody wiU be purified (1) 
to greater than 95 % by weight of antibody as determined by the Lowry method, and most preferably more than 
99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 
sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or 



315 



wo 00/73454 





PCT^SOO/08439 



nonreducing conditions using Ccwmassie blue or, preferably, silver stain. Isolated antibody includes the 
antibody in situ within recombinant cells since at least one component of the antibody's natural environment 
will not be present. Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

The word "label" vi^en used herein refers to a detectable compound or composition which is 
conjugated directly or indirectly to the antibody so as to generate a "labeled" antibody. The label may be 
5 detectable by itself (e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may 
catalyze chemical alteration of a substrate compound or composition which is detectable. 

By "solid phase** is meant a non-aqueous matrix to which the antibody of the present invention can 
adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass (e.g. , 
controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and 
10 silicones. In certain embodiments, depending on the context, ±e solid phase can comprise the well of an assay 
plate; in others it is a purification colunm (e.g., an affmity chromatography colunm). This term also includes 
a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
which is usefiil for delivery of a drug (such as a PRO polypeptide or antibody thereto) to a mammal. The 
15 components of the liposoo^ are commonly arranged in a bilayer formation, similar to the lipid arrangement 



of biological membranes. 

A ^small molecule" is defined herein to have a molecular weight below about 500 Daltons. 
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Table 1 



• C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

• match with stop is M; stop-stop = 0; J (joter) match - 0 
*/ 

^define _M -8 /* value of a match with a stop */ 

int day[26J[26] = { 

/• a'bcdefghijklmnopqrstuvwxyz*/ 

/* A */ { 2. 0.-2, 0, 0.-4, 1,-1.-1. 0,-1,-2,-1. 0, M, 1. 0,-2, 1, 1. 0, 0.-6, 0,-3, 0}, 

/* B ♦/ { 0, 3.-4, 3, 2.-5, 0, 1.-2, 0, 0.-3.-2, 2, M,-l. 1. 0, 0, 0. 0.-2,-5, 0,-3. 1}, 

/» C */ {-2.-4.15,-5,-5,^.-3.-3.-2. 0,-5.-6.-5,-4, M.-3.-5.-4, 0.-2, 0.-2,-8. 0, 0.-5}, 

/» D */ { 0, 3,-5, 4. 3,-6. 1. 1.-2. 0. 0,-4,-3. 2, M,-l, 2,-1. 0. 0. 0.-2.-7, 0,-4, 2}. 

/* E */ { 0, 2,-5, 3. 4,-5. 0. 1.-2, 0, 0,-3,-2. 1. M.-l. 2,-1, 0, 0. 0,-2.-7, 0,-4. 3}. 

/* F •/ {^,-5,-4,-6.-5. 9,-5.-2, 1, 0.-5, 2, 0.-4,_M.-5,-5.-4,-3.-3. O.-l, 0, 0. 7.-5). 

/* G ♦/ { 1, 0,-3. 1. 0.-5. 5.-2,-3, 0.-2,-4,-3, 0._M.-l.-l,-3. 1, 0, 0.-1,-7. 0,-5, 0}, 

/* H •/ {-1, 1,-3, I, 1,-2.-2, 6,-2, 0, 0.-2,-2, 2,_M, 0. 3. 2,-1,-1. 0.-2,-3. 0. 0, 2}. 

/♦ I */ {-1,-2.-2.-2,-2, 1,-3,-2, 5, 0.-2. 2, 2,-2, M.-2.-2,-2.-l, 0, 0, 4.-5, 0.-1,-2}, 

/• J •/ { 0, 0. 0. 0, 0. 0. 0, 0, 0. 0. 0. 0. 0. 0. M, 0. 0. 0. 0, 0, 0. 0, 0. 0, 0, 0}. 

/* K ♦/ {-1. 0,-5. 0. 0,-5.-2, 0,-2. 0, 5,-3, 0, 1,_M.-1. 1. 3, 0. 0, 0.-2.-3, 0,-4. 0). 

/* L */ {-2.-3.-6.-4.-3. 2,-4,-2. 2, 0.-3, 6. 4.-3._M,-3,-2,-3,-3.-l, 0, 2.-2, 0.-1,-2}, 

/* M ♦/ {-1.-2.-5.-3,-2, 0.-3.-2. 2. 0. 0, 4, 6.-2. M.-2.-1, 0.-2.-1, 0, 2,^, 0.-2,-1}, 

/* N */ { 0. 2.^. 2, 1.-4. 0, 2.-2. 0, 1.-3,-2. 2,_M,-1, 1, 0. 1. 0, 0.-2.^. 0,-2, 1}. 

/• O */ {_M._M,_M._M,_M._M,_M._M._M._M._M._M,_M._M. 0,_M,_M._M._M.„M 

/* P */ { 1,-1.-3.-1.-1.-5.-1, 0.-2, 0.-l.-3.-2.-l,_M. 6. 0, 0. 1. 0, 0.-1.-6, 0,-5. 0}, 

/* Q */ { 0, 1,-5. 2. 2.-5,-1, 3.-2, 0. 1,-2.-1, l._M, 0. 4. l.-l.-l. 0,-2.-5. 0.^, 3}, 

/* R ♦/ {-2, 0,^,-1.-1,-4.-3. 2,-2, 0, 3,-3, 0, 0._M. 0. 1, 6, 0,-1. 0,-2. 2. 0.^, 0}, 

/* S ♦/ { 1. 0, 0. 0, 0.-3. l.-l.-l. 0, 0.-3,-2. 1,_M, 1,-1. 0, 2. 1, 0.-1.-2, 0,-3, 0}. 

/* T */ { I, 0.-2, 0. 0,-3, 0,-1. 0, 0, 0,-1.-1, 0, M, O.-l.-l. 1. 3, 0, 0.-5, 0.-3. 0}. 

/* U •/ { 0. 0. 0. 0, 0. 0. 0. 0, 0, 0. 0. 0. 0, 0._M, 0. 0. 0, 0, 0, 0. 0, 0. 0, 0, 0}, 

/* V */ { 0.-2.-2.-2.-2.-1,-1.-2. 4, 0,-2, 2, 2,-2,_M.-l.-2.-2.-l. 0, 0, 4.^, 0.-2,-2}, 

/* W ♦/ {-6,-5,-8.-7,-7. 0,-7.-3,-5, 0,-3,-2.-4,-4, M,-6.-5. 2,-2.-5, 0.-6,17. 0. 0,-6}, 

/* X •/ { 0, 0, 0. 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, M, 0. 0, 0. 0, 0, 0, 0, 0. 0. 0, 0}. 

/* V */ {-3,-3. 0.^,-4, 7.-5, 0,-1, 0,-4,-l.-2,-27„M.-5.-4,-4,-3,-3, 0.-2, 0. 0.10.-4}. 

/♦ Z ♦/ { 0. 1.-5. 2. 3,-5. 0, 2.-2. 0. 0.-2.-1. l._M. 0. 3. 0, 0, 0, 0,-2,-6, 0,-4, 4} 
}: 
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Table 1 rcont*> 



^include <sttlio.h> 






^include <ctype.h> 










/* ftiAic iiimn^ in a diAC */ 


^define MAXGAP 


24 


/* don't continue to penalize gaps larger than this */ 


^define JMPS 


1024 


/* max jmps in an path •/ 


J^define MX 


4 


/* save if there's at least MX-1 bases since last jmp ♦/ 


#define DMAT 


3 


/* value of matching bases */ 


^define DMIS 


0 


/* penalty for mismatched bases *f 


^define DINSO 


8 


f* penalty for a gap */ 


^define DINSl 


1 


/* penalty per base ♦/ 


#deeme PINSO 


8 


/* penalty for a gap */ 


#denne PINSl 


4 


/* penalty per residue •/ 


struct jmp { 







short 



n[MAXJMP]; 



}: 



unsigned short x[MAXJMP]; 



struct diag { 
int 
long 
short 



}; 

struct path { 



struct jmp 



score; 
offset; 
ijmp: 

jp; 



/• size of jmp (neg for dely) */ 
/• base no. of jmp in seq x */ 
/* limits seq to 2*16-1 •/ 



/* score at last jmp •/ 
/* offset of prev block ♦/ 
/• current jn^ index */ 
/• list of jmps */ 





int 


spc; 


/♦ number of leadii^ spaces */ 




short 


n[JMPS): 


/* size ofjn^)(gap) */ 


}; 


int 


x(JMPS]; 


/* loc of jmp (last elem before gap) */ 


char 




♦ofiJe; 


/* output file name */ 


char 




*namex[2]; 


/* seq names: getseqsO */ 


char 




*prog; 


/* prog name for err msgs */ 


char 




*seqx[21; 


/* seqs: getseqsO */ 


int 




dmax; 


/• best diag: nwQ */ 


int 




dmaxO; 


/* final diag */ 


int 




dna; 


/* set if dna: mainQ */ 


int 




endgaps; 


/* set if penalizing end gaps */ 


int 




gapx. gapy; 


/* total gaps in seqs */ 


int 




lenO, lenl; 


/♦ seq lens •/ 


int 




ngapx, ngapy; 


/* total size of gaps */ 


int 




smax; 


/* max score: nwO */ 


int 




*xbm; 


/* bitmap for matching ♦/ 


long 




offset; 


/* current offset in jmp file */ 


struct 


diag 


*dx; 


/* holds diagonals */ 


struct 


path 


PP[21; 


/* holds path for seqs */ 


char 




♦callocO. *maUocO, ♦indexO, *strcpyO; 


char 




♦getseqO, *g_callocO; 
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T^ble 1 (cogf) 

/* Ncedleman-Wunsch alignment program 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ';*,'>' or '<' are ignored 

* Max file length is 65533 (limited by unsigned shon x in the jmp struct) 

* A sequence with 1/3 or more of its elemems ACGTU is assumed to be DNA 

* OuQ>ut is in the file "align.out" 
» 

* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
•/ 

^include *nw.h* 
^include "day.h" 



static dbval[26] - { 

T, 14,2, 13,0,0,4, 1 1 ,0,0. 12,0,3. 15,0,0,0,5.6,8.8.7 .9,0, 10,0 

}; 

static _pbval[261 = { 

1, 2|{1< <CD'-'A'))|(1< <(*N'-'A')). 4, 8, 16, 32, 64. 
128. 256, OxFFFFFFF, 1< <10. 1< <n, 1< <12, 1< <13. 1< <14. 
1<<15. 1<<16. I<<17, 1<<18, 1<<19, 1<<20, 1<<21, 1<<22, 
1<<23. l<<24, 1<<25|(1<<('EVA'))|(1<<CQ*-'A')) 

}; 



main(ac, av) 1X13111 
int ac; 
char *avO; 

{ 

prog = av[0]; 
lf(ac!=3){ 

fiprimf(stderr, "usage: %s file! file2\n", prog); 

^rintCtstderr, "where filel and fi]e2 are two dna or two protein sequences. \n"); 
^rintf(stderr."The sequences can be in upper- or lower-case\n*); 
^rintf(stderr,'Any lines beginning with ';' or ' < ' are igaored\n"); 
fi)rintf(stderr, "Output is in the file \"align.out\"\n'); 
exit(l); 

} 

namex[0] = av[l); 

namex[l) = av[2]; 

seqx[0] = getseq(namex[0], &lenO); 

seqx[l] = getseqCnamex[l], &lenl); 

xhm = (dna)? _dbval : j)bval: 



} 



endgaps = 0; /* 1 to penalize endgaps ♦/ 

ofile = "align.out*; /* output file */ 

nwO; /* fill in the matrix, get the possible jmps */ 

readjmps(); /* get the actual jmps */ 

printO; /* print stats, alignment */ 

cleamip(0): /* unlink any tmp files */ 
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/* do the alignmem, return best score: mainO 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
•/ 

nw() nw 

{ 



cbar 


*px. *py; 


/* seqs and ptrs */ 


int 


•ndely, *dely; 


/• keep track of dely */ 


int 


ndelx, delx; 


/♦ keep track of delx */ 


int 


♦tmp; 


/* fbr swapping rowO, rowl */ 


tnt 


mis; 


/• score for each type ♦/ 


int 


insO, ins I; 


/• insertion penalties •/ 


register 


id; 


/• diagonal index */ 


register 


ij: 


/* jmp index */ 


register 


•colO, *coU; 


/* score for curr, last row */ 


register 


XX. yy; 


/* index into seqs *( 



dx = (struct diag *)g_calIoc("to get diags", lenO+lenl + 1, sizeor(stnict diag)); 

ndely - (int *)g^calloc("to get ndely", leni + 1, sizeof(int)); 
dely = (int •)g_calloc("to get dely", lenl + 1, sizeof(int)); 
coIO = (int *)g_calloc("to get colO", Ienl + 1, sizeof(int)); 
coll = (int *)g_calloc("to get coll", lenl + 1, sizeof(int)); 
insO = (dna)? DINSO : PINSO; 
insl = (dna)? DINS I : PINSl; 

smax = -10000; 
if (endgaps) { 

for (colO[01 = dely[0] = -insO, yy = 1; yy < = lenl; yy + +) { 
colO[yy] = delytyy] = colOtyy-1] - insl; 
ndelyb^l = yy: 

} 

colO[0] = 0; /♦ Waterman Bull Math Biol 84 */ 



} 

else 



for (yy = 1; yy < = lenl; yy++) 
dely^] = -insO; 



/* fill in match matrix 
*/ 

for (px = seqx[0], xx = 1; xx < = lenO; px+ + , xx+ +) { 
/* initialize first entry in col 
*/ 

if (endgaps) { 

ff(xx=-l) 

coll[0] = delx = -(insO+insl); 

else 

coll[0] = delx = colO[0] - insl; 
ndelx = xx; 



} 

else { 



coll [01 = 0; 
delx = -insO; 
ndelx = 0; 
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Tabte 1 (conV) 



for (py = seqx[l], yy = I; yy < = lenl; py+ +, yy+ +) { 
mis = colO{yy-l]; 
ifCdna) 

mis += (xbm[*px-'A']&xbm[*py-'A'J)? DMAT : DMIS; 

else 

mis += _day[*px-'A'][*py-'A']; 

/* update penalty for del in x seq; 

* fevor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps | ] ndely[yyl < MAXGAP) { 

if (colO[yy] - insO> = dely[yy]) { 

dely[yy] = colOl>yl - (insO+insl); 
ndelyiyyl = 1; 

}else{ 

delyfyy] -= insl; 
ndelyiyy]++; 

} 

}else{ 

ir(colO[yy] - (insO+insl) > = dely[yy]) { 
delytyyj = coIOtyy] - (insO+insI); 
ndely[yy) - 1; 

} dse 

ndely[yy]++; 

} 

/* update penalty for del in y seq; 

* favor new del over ongong del 
*/ 

if (endgaps 1 1 ndelx < MAXGAP) { 

if (coll[yy-l] - insO> = delx) { 

delx = coll[yy-l] - (insO+insI); 
ndelx = 1: 

}elsc{ 

delx-= insl; 
ttdelx++; 

} 

}e!se{ 

if (coll[yy-l] - (insO+insI) > = delx) { 
delx = coll[yy-l] - (insO+insI); 
ndelx = 1; 

} else 

ndelx++; 

} 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 
♦/ 
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id = xx-yy + lenl - 1; 
if (mis > = delx && mis > = dely[yy]) 
coU(yy] = mis: 

5 else if (delx > = dely[yy]) { 

coll[yy] = delx; 
ij = dx[id].ijmp; 

if (dx[id).jp.n[0] && (Idna | ] (ndelx > = MAXJMP 
&& XX > dx[id].jp.x[ij]+MX) 1 1 mis > dx(id] .score +DINSO)) { 
10 dx[id].ijmp+ + ; 

if(++ij > = MAXJMP) { 
writejmps(id); 
ij = dx[id]:ijmp = 0; 
dx[id].offset = offset; 

15 offset + => sizeof(struct jmp) + sizcoftoffset); 

} 

} 

dx[id].jp.n[ij] = ndelx; 
dxtid).jp.x[ij) = xx; 

20 dx[id].score = delx; 

} 

else { 

coll[yy] = dely[yy]; 
ij = dx[id].ijmp; 

25 if (dx[id].jp.iHO] && (Idna ) | (ndely(yy] > = MAXJMP 

&&XX > dxlid].jp.x[iJ]+MX) | [ mis > dx[id].score+DINSO)) { . 
dx[idl.ijmp+ + ; 
if ( + +ij >= MAXIMP) { 
Mrtitejiiq)s(id); 

30 ij = dx[id].ijmp = 0; 

dx(id].ofeset = of&et; 

offset + = sizeofCstruct jmp) + sizeof(ofi^t); 

^ > 

35 dx[idl.jp.n[ij] = -ndely[yy]; 

dx[id].jp.x[ij] = xx; 
dx[id). score = dety[yy]; 

} 

if (xx == lenO && yy < lenl) { 
40 /* last col 

♦/ 

if (endgaps) 

coll[yy] -= insO+insl*(lenl-yy); 
if (coll [yy] > smax) { 
45 smax = coIl[yy]; 

dmax = id; 

> 

} 

50 if (endgaps && xx < lenO) 

colUyy-1] -= insO+insl*(lenO-xx): 
if (coIl[yy-l) > smax) { 

smax = coll[yy-l]; 
dmax — id; 

55 } 

tmp = colO; colO = coll; coll = tmp; 

} 

(void) &ee((char *)ndely); 
(void) free((char *)dely); 
60 (void) free((char *)colO); 

(void)fTee((char*)coll); } 
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« 

* printO - only routine visible outside this module 
* 

5 * static: 

* getmacO ~ trace back best path, count matches: printO 

* pr_alignO print alignment of described in array pQ: print() 

* dumpblockO ~ dump a block of lines with numbers, stars: pr^alignO 

* immsO - put out a number line: dumpblock() 

10 * putlineO - put out a line (name, [num], seq, [num]): dumpblockQ 

* starsO - -put a line.of stars: dumpblockQ 

* strtpnameO - strip any path and prefix from a seqname 
*/ 

15 ^include "nw.h" 

ifdefineSPC 3 

^deflne P_LINE 256 /* maximum output line */ 
/{'define P_SPC 3 /* space between name or num and seq */ 

extern _day[26][261; 

int olen; /* set output line length */ 

FILE /* output file ♦/ 

25 prime print 

{ 

int Ix, ly, firstgap, lastgap; /* overlap */ 

if ((fic = fopen(ofile. "w*)) = = 0) { 
30 fprintf(stderr,'%s: can't write 5Es\n", prog, ofile); 

cleanup(l): 

} 

lprintf(fx, "< first sequence: %s Oength = %d)\n', namexIO], lenO); 
fi)rintfi(fx, "< second sequence: %s (length = %d)\n", namex[l], Jenl); 
35 olen = 60; 

Ix = lenO; 
ly = lenl; 

firstgap = lastgap = 0; 

if (dmax < lenl - I) { /♦ leading gap in x */ 
40 pp[0].spc = firstgap = lenl - dmax - 1; 

ly -= pp[0].spc: 

} 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l].spc = firstgap ~ dmax - (lenl - 1); 
45 Ix -= pp[l]-spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
Ix -= lastgap; 

50 } 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap = dmaxO - (lenO - 1); 
ly -= lastgap; 

55 getmat(lx, ly, firstgap, lastgap); 

pr_alignO; 
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Table 1 (contO 

/• 

* trace back the best path, count matches 
*/ 

static 

5 getmat(lx, ly. firsigap. lastgap) getmat 
tnt U, ly; /* "core" (minus endgaps) */ 

tnt firstgap, lastgap: /* leading trailing overlap */ 

{ 

int nm, iO, il, sizO, sizl; 

10 char outx[32]; 

double pet; 
register nO, nl; 

register char 'pO, *pl; 

15 /* get total matches, score 

*/ 

iO = il = sizO = sizl = 0; 
pO = seqx(0] + pp[l].spc; 
pi = seqx[l] + pp(0].spc; 
20 nO = pp[l].spc + 1; 

nl = ppioj.spc + 1; 

nm = 0; 

whae(*pO&& *pl ){ 
25 if(sizO){ 

pl-f + ; 
nH- + ; 
sizO-; 

30 else if (sizl) { 

pO+ + ; 
nO+ + ; 
sizl—; 

35 dse{ 

if (xbm[*pO-*A']&xbm[*pl-'A*l) 

nm+ + ; 
if(nO++ pp[0].x[iO]) 

sizO = pp[0].n[iO++]; 
40 if(nl + + ==pp[l].x[il]) 

sizl ^ pp[l].nltl ++]: 

pO+ + ; 
pl + + ; 

45 > >. 

/* pet homology: 

♦ if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 
50 */ 

if (endgaps) 

Ix = (lenO < lenl)? lenO : lenl; 

else 

Ix = (Ix < ly)? \x : ly; 
55 pet = 100.*(double}nni/(double)lx; 

fjjrintfCfx. "\n"); 

fjjrintfC&c, ■ < %d match%s in an overlap of %d: %.2f percent similarity\n", 
nm, (nm == 1)? : "es", Ix, pet); 

60 
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Table 1 (cont^) 



fprintfffx, "<gaps in first sequence: %d", gapx); 



...getmat 



if(gapx){ 

(void) sprintfCoutx, " (%d %s%s)", 

ngapx, (dna)? "base": "residue', (ngapx == 1)? "":"s"); 
fprintf(fx,"%s", outx); 

fprintfiCfx, gaps in second sequence: %d', gapy); 

lf(gapy){ 

(void) SprintfCoutx, " (%d %s%s)". 

ngapy, (dna)? "base": "residue", (ngapy =- I)? ""r's"); 
fprimf(6t,"%s". outx); 



if (dna) 

fprintPffx. 

■\n< score: %d (match - %d, mismatch = %d, gap penalty = %d + %d per base)\n", 
smax, DMAT. DMIS, DINSO. DINSl): 

else 

^rinrf(fx, 

"\n< score: %d (Dayhoff PAM 250 matrix, gap penally = %d + %d per residue)\n". 
smax, PINSO, PINSl); 
if (endgaps) 

^rintf(fx. 

"< endgaps penalized, leftendgap: %d %s%s. right endgap: %d 96s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap == I)? "" : "s", 
lastgap, (dna)7 "base" : "residue", (lastgap ==!)?"": "s"); 

else 

fyrintf(fx, "< endgaps not penalizedXn"); 



} 



} 



static 
static 
static 
static 
static 
static 



Imax; 

ij[2]; 

nc[2]; 

ni[2]; 

siz[2]; 

*ps[2]; 

*po[2]; 



nm; 



/♦ matches in core - for checking */ 
/♦ lengths of stripped file names */ 
/* jmp index for a path */ 



/♦ number at start of current line */ 

/♦ currem elem number - for gapping */ 



static char 
static char 
static char 
static char 



outl2][P_LINEI; 
star[P_LINE]; 



/* ptr to current element */ 
/* ptr to next output char slot */ 
/♦ output line •/ 
/♦ set by starsO */ 



/* 

♦ print alignment of described in struct path ppD 
*/ 

static 
pr alignO 
{" 



pr_align 



bit nn; /* char count */ 

int more; 
register i; 



for (i = 0, Unax = 0; i < 2; i+ +) { 
nn = stripname(namex[i]); 
if (nn > Imax) 

Imax = nn; 



nc[i] = 1; 
niW = 1; 
siztij = ij[il = 0; 
pstil = seqx[i]; 
po(i] = out[i]: 



} 
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Table 1 (conV) 

for (nn = nm = 0. more = I; more; ) { ...pr_align 
for (i = more = 0; i < 2; i+ +) { 
/* 

5 * do we have more of this sequence? 

*/ 

if(!*ps[i]) 

continue; 

10 more+ + ; 

if (ppli] .spc) { /* leading space */ 
*po[i]++ = "; 
pp[i].spc--; 

15 > 

else if (siz[i]) { /* in a gap */ 
*po[i]++ = •-•; 
siz[i]-; 

20 else { /* we're putting a seq element 

*/ 

*po[i) = *Psti]: 
if(islowerC*ps[il)) 

*ps[i] = touppcr(*psti3): 

25 po[i]+ + ; 

psti] ++; 

/* 

* are we at next gap for this seq? 
30 */ 

if(nili] ==pp[i].x[ij[i]]){ 
/• 

• we need to merge all gaps 

* at this location 
35 */ 

sizti] =pp[i].n[ij[i]++]; 
whfle(ni[i]==pptil.x[ij(i3]) 

siz[ij += pp[i].n[ij[il++]; 

40 niEiJ + + ; 

} 

} 

if (+ +nn == olen | j !more && nn) { 
dumpblockO; 

45 for(i =0;i < 2;i++) 



} 



po[i] = out[il; 

nn = 0; 



50 } 
/* 

* dump a block of lines, including numbers, stars: pr alignO 
♦/ 

55 static 

dumpblockO dlUTipblock 
{ 

roister i; 

60 forCi =0; i < 2; i++) 

•po[i]- = '\0'; 
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...dumpblock 

(void) putc('\n\ fx); 
for(i = 0; i < 2; i + +) { 
5 if (*out[i] && (*out[i] != • ' 1 1 '(PoH) !=")){ 

if(i==0) 

[luins(i); 
if (i = = 0&&*<mt[lj) 
starsQ; 

10 putline(i); 

if (i = = 0&&*out[13) 

fjprintfCfii, star); 
if(i =- 1) 

iiums(i); 

15 } 
} 

} 
/* 

20 * put out a number line: dumpblockQ 

*/ 

static 

nums(ix) nums 
int ix; /♦ index in outQ holding seq line */ 



25 



char Dllne[P_LINEJ; 

register i, j; 

roister char *pn, *px, *py; 



30 for (pn = nline, i = 0; i < Imax+P_SPC; i+ +, pn+ +) 

*pn = ' '; 

for (i = nc[ix], py = out[ix]; *py; py+ +^ pn+ +) { 

if (*py =i • • I I »py = = '.') 



35 else { 



*pii ■ 



if(i%10 ==0 II (i == l&&nc[ix] !« 1)) { 
j = (i< 0)? -i : i; 
for (px = pn; j; j / = 10; px— ) 
*px = j%10 + '0'; 

40 if(i<0) 



} 

else 



*px = 



*pn 



45 . i++; 

} 

} 

*pn = '\0'; 
nc[ix] = i; 

50 for (pn = nliitt; 'pn; pn+ +) 

(void) putc(*pn, fx); 
(void)putc('\n*, fx); 

} 

55 /* 

* put out a line (name, [num], seq, [num]): dumpblockO 
*/ 

static 

puUine(ix) putluie 
60 int ix; { 
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Table 1 (conV) 

...putliae 

bit i; 
register char •px; 

for (px = namexlix], i = 0; *px && *px != px+ +, i++) 

(void) putc(*px, fx); 
for (; i < lmax+P_SPC; i+ +) 

(void)putc(' fit); 

/♦ these count from 1 : 

* ni[] is current element (from 1) 

* ncQ is number at start of current line 
*/ 

for (px = out[ixl; *px; px++) 

(void) putc(*px&0x7F, 6t); 
(void) putc('\n\ fx); 

} 



/• 

* put a line of stars (seqs always in out[0]. out[l]): dumpblock() 
♦/ 

static 

starsO Stars 

{ 

Int i; 

roister char *pO, *pl, cx, *px; 

if (!*outIO] 1 1 (*out[0] =='•&& *(po[0]) = = • •) 1 1 
!*out[l) 1 1 Coutll] - - " && *(po[l]) = =•')) 
return; 

px =s Star; 

for (i = Imax+P SPC; i; i~) 
*px++ = • 

for (pO = out[0], pi = out[l]; *pO && *pl; pO+ +, pl+ +) { 
if (isalpha(*pO) && isalpha(*pl» { 

if (xbm[*pO-'A']&xbm[*pl-'A*]) { 
cx = 
nm++; 

} 

eke if (!dna && _day[*pO-'A*]t*pl-'A*l > 0) 
cx = 

else 

cx = * '; 

} 

else 

cx = * '; 
*px++ = cx; 

} 

♦px+ + - '\n'; 
*px = '\0'; 

} 
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Table 1 rcont*) 

/• 

• strip path or prefix from pn, return len: pr_alignO 
*/ 

static 

5 stripname(pn) Stripname 
char *pn; /♦ file name (may be path) */ 

{ 

register char ♦px, *py; 

10 py = 0; 

for (px =» pn; *px; px++) 
if Cpx ==•/•) 

py = px + 1; 

if(py) 

15 (void) stn;py(pn, py); 

retum(strleii(pn)); 

} 

20 
25 



30 



35 



40 



45 



50 



55 



60 
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Table 1 (cQnf) 

/* 

* cleanupO - cleanup any dnp file 

* getseqO -- read in seq, sec dna, len, maxlen 

* g callocO - callocO with error checkin 

* readjmpsQ - get the good jmps. from tmp file if necessary 

* writejmpsO ~ write a filled array of jmps to a tmp file: nwQ 
*/ 

^include "nw.h" 
^include <$ys/file.h> 

char *jname «= Vtmp/homgXXXXXX"; /* tmp file for jmps */ 

FILE *fj; 

int CleanupO; /* cleanup tmp file *! 

long IseekO; 

/* 

* remove any tmp file if we blow 
*/ 

cleanup(i) cleanup 
int i; 

{ 

if(f|) 

(void) unlink(jname); 

exit(i); 

} 
/• 

* read, return ptr to seq, set dna, len, maxlen 
♦skip lines starting with *<',or '>' 

* seq in upper or lower case 
*/ 

char * 

getseqCfile, len) getseq 
char ♦file; /* file name ♦/ 
int ♦len; /♦ seq len ♦/ 



{ 



char Iine[l(a4], *pseq; 

register char *px, *py; 

int natgc, tlen; 

FILE *fp; 

if ((fjp = fopenCfile.V")) == 0) { 

fprintf(stderr,"%s: can't read %s\n", prog, file); 
exit(l); 

} 

Uen = natgc = 0; 

while (fgets(line. 1024, fp)) { 

if(*line == ';' || ♦line == '<■ |j *line == *>') 

continue; 
for (px = line; ♦px != •\n'; px++) 

if (isupper(*px) | ( islower(^px)) 
tlen+ + : 

} 

if ((pseq = nialloc((unsigncd)(tlen+6))) == 0) { 

fprintf(stderr,"%s: mallocO failed to get %d bytes for %s\n", prog, tlen+6, file); 
exit(l): 

} 

p$eq[0] = pseq[I] = pseq[2] = pseq[3J = '\0'; 



330 



10 



15 



20 



WO 00/73454 



Table 1 (contM 



II *line ==■>■) 



py = pseq + 4; 
♦len = tlen; 
rewind(fy); 

while (fgetsCline, 1024. ip)) { 

if(*liiie == •;• || *line 

continue; 
for (px = line; ♦px ! = '\n'; px+ +) { 
if (isupper(*px)) 

♦py++ = ♦px; 
else if {islower(*px)) 

*py++ = toiq>per(*px); 
if (index("ATCjCU".*{py-l))) 
natgc++; 

} 

} 

*py++ = '\0*; 

*py = '\0'; 

(void) fclose(fp); 

dna = natgc > (tlen/3); 

retuni(pseq+4); 



PCTAJSOO/08439 



...getseq 
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char * 

g_catIoc(nisg. nx, sz) 

char *msg; 
int nx, sz; 



/* program, calling routine */ 
/* number and size of elements */ 



{ 



char 



*px, *callocO; 



g_calloc 



if ({px - ca]loc((unsigned)nx, (unsigned)sz)) = = 0) { 
if(*msg){ 

irimf(stderr, "%s: gjcailocO foiled %s (n=%d, sz=%d)\n", prog, msg, nx, sz); 
exitd); 

} 

} 

retum(px); 



/*. 

• get final jmps from dx[] or tmp file, set pp[], reset dmax: main() 
*/ 

readjmpsO 
{ 



readjmps 



int fd = -1; 

int siz, iO, il; 

roister i, j, xx; 



} 



(void) fclose(fj); 

if ((fd = openOname, 0_RDONLY. 0)) < 0) { 

^rimfCstderr, "%s: can't openO %s\n", prog, jname); 
cleamip(l); 

} 



for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; i + +) { 
whUe(l){ 

for (j = dx[dmax].ijmp; j > = 0 Sl& dx[dniax].jp.x[j] > = xx; j-) 
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Table 1 (cont*) 

...readjmps 

if (j < 0 && dx[dma]i] offset && { 

(void) lseek(fd. dx[dinax]. offset, 0); . 
(void) read(fd, (char ♦)&dx[dmax].jp, sizeofCstruct jmp)); 
5 (void) read(fd, (char *)&dx[dmaxi.offsct. sizeof(dx[dmax]. offset)); 

dx[dmax].ijmp = MAXJMP-l; 

} 

else 

break; 

10 } 

if(i >«= JMPS){ 

fprintf(stderr, "%.s: too many gaps in alignmentVn", prog); 
cleaniip(l); 

15 if(i> = 0){ 

siz *= dx[dmax}.jp.iilj]; 
XX = dx[dinax].jp.xljl; 
dmax + — siz; 

if (siz < 0) { /* gap in second seq ♦/ 

20 pptl].iHiIl = -siz; 

XX + = siz; 

/* id = XX - yy + lenl - 1 
♦/ 

pp[l].x[il] - XX - dmax + lenl - 1; 
25 gapy++; 

ngapy - = siz; 
/* ignore MAXGAP when doing endgaps ♦/ 

siz = (-siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP; 
11 + +; 

30 } 

else if (siz > 0) { /* gap in first seq ♦/ 
pp[0],n[iO] = siz; 
pp[01.xtiO] = xx; 
gapx++; 

35 ngapx + — siz; 

/♦ ignore MAXGAP when doing endgaps ♦/ 

siz ^ (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
iO++; 

40 } * 

else 

break; 

} 

45 /* reverse ihe order of jmps 

*/ 

for(j = 0, lO-; j < iO; + , iO~) { 

i = pp[0).nU]; pp[0]-n|jl = pp[0].n[iO]; pp[0).ntiO] = i; 
i - pp[0]-x[j]: pp[0].xljl = pp[0].x[i01; pp[0].x[iO] = i; 

50 } 

for(j = 0, il-; j < il;j++. il-) { 

i = pp[ll.nU]:pp[l].n[jl = pp[I].nlil]; pp[l).n[il] = i; 
i = PPUUlih PPUUU] = PPtlI.x[ill; pp[ll.x[il] = i; 

55 ir(fd>=0) 

(void) cIose(fd); 

(void) unlink(jname); 
5 = 0; 

60 offset = 0; 

} } 
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Table 1 (conV) 

/* 

* write a filled jmp struct offiset of the prev one (if any): nwO 
•/ 

5 writejiiq)s(ix) writcjmps 
inl ix; 

{ 

char *mlrtempO; 
10 if(!g){ 

if (mktempOnarae) < 0) { 

fprimfCstderr, "%s: can't mletempO %s\n", prog, jname); 
cleanupCl); 

15 if ((Q = fopen(jname. "w")) = = 0) { 

fiprintf(stderr, can't write %s\n", prog, jname); 
exit(l); 

} 

20 (void) fwrite((char *)&dx[ixl.jp, sizeofCstruct jmp). 1, §); 

(void) fwrite((char ♦)&dx[ixj .offset, sizeof(dx[ix] .offset). 1, Q); 
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Table 2 
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PRO XXXXXXXXXXXXXXX (Length « 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

5 % amino acid sequence identity — 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) ~ 

10 5 divided by 15 = 33.3% 
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Table 3 
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PRO XXXXXXXXXX (Length « 10 amino acids) 

Conq)arison Protein XXXXXYYYYYY2ZYZ (Length - 15 amino acids) 

5 % amino acid sequence identity - 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

10 Sdividedby 10 =50% 
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Table 4 

PRO-DNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length ^ 16 nucleotides) 

% nucleic acid sequence identity - 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

6 divided by 14 = 42.9% 
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Table 5 
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PRO-DNA NNNNNNNNKNNN (Length = 12 nucleotides) 

Comparison DNA NNNNLLLVV (Length = 9 nucleotides) 

5 % nucleic acid sequence identity = 

(the mimber of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

10 4 divided by 12 = 33.3% 
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il. Compositions and Methods of the Invention 

A. Full-Leneth PRO Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as PRO polypeptides. In particular, cDNAs encoding 
various PRO polypeptides have been identified and isolated, as disclosed in further detail in the Examples 
5 below. It is noted that proteins produced in separate expression rounds may be given different PRO numbers 
but the UNQ number is unique for any given DNA and the encoded protein, and will not be changed. 
However, for sake of simplicity, in the present specification the protein encoded by the full length native 
nucleic acid molecules disclosed herein as well as all further native homologues and variants included in the 
foregoing definition of PRO, will be referred to as "PRO/number", regardless of their origin or mode of 
10 preparation. 

As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 
actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of 
the deposited clone using routine methods in the art. The predicted amino acid sequence can be determined 
from the nucleotide sequence using routine skill. For the PRO polypeptides and encoding nucleic acids 
1 5 described herein. Applicants have identified what is believed to be the reading frame best identifiable with the 
sequence information available at the time. 

1. Full-length PR0281 Polvpeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
20 polypeptides referred to in the present application as PR0281 (UNQ244). In particular, cDNA encoding a 
PR0281 polypeptide has been identified and isolated, as disclosed in fiirther detail in the Examples below. 

Using the WU-BLAST-2 sequence alignment computer program, it has been found that a full-length 
native sequence PR0281 (shown in Figure 2 and SEQ ID NO:2) has certain amino acid sequence identity with 
the rat TEGT protein. Accordingly, it is presently believed that PR0281 disclosed in the present application 
25 is a newly identified TEGT homolog and may possess activity typical of that protein. 

2. Full-length PR0276 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as PR0276 (UNQ243). In particular, cDNA encoding a 
30 PR0276 polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA 16435- 1208 sequence encodes a novel factor designated herein as 
PR0276; using WU-BLAST-2 sequence alignment computer programs, no significant sequence identities to 
any known proteins were revealed. The sequence identity identifications which were found are listed below 
in the examples. 

35 

3. Full-len^h PR0 189 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
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74. Full-length PR0812 PolvDeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as PR08I2. In particular, cDNA encoding a PR0812 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PROS 12 (shown in Figure 178 and SEQ ID NO:260) has amino acid sequence identity with 
the prostatic steroid-binding cl protein. Accordingly, it is presently believed that PR0812 disclosed in die 
present application is a newly identified prostatic steroid-binding cl protein homolog. 

75. Full-length PRO1069 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as PRO1069. In particular. Applicants have identified and 
isolated cDNA encoding a PRO 1069 polypeptide, as disclosed in further detail in the Examples below. Using 
BLAST and FastA sequence alignment computer programs, it was found that the PRO1069 polypeptide has 
sequence identity with CHIP. Accordingly, it is presently believed that PRO1069 polypeptide disclosed in the 
present application is a newly identified CHIP polypeptide and is involved in ion conductance or regulation 
of ion conductance. 

76. Full-length PR0 1129 Polvoeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as PROl 129. In particular. Applicants have identified and 
isolated cDNA encoding a PROl 129 polypeptide, as disclosed in further detail in the Examples below.. Using 
BLAST and FastA sequence alignment computer programs. Applicants found that the PROl 129 polypeptide 
has significant similarity to the cytochrome P-450 family of proteins. Accordingly, it is presently believed that 
PROl 129 polypeptide disclosed in the present application is a newly identified member of the cytochrome P- 
450 family and possesses activity typical of that family. 

77. Full-length PRO1068 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as PRO 1068. In particular, cDNA encoding a PRO 1068 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PRO 1068 has amino acid sequence identity with urotensin. Accordingly, it is presently believed that 
PRO 1068 disclosed in the present application is a newly identified member of the urotensin family and may 
possess activity typical of the urotensin family. 

78. Full-length PRQW6 PolYpeptid^ 

The present invention provides newly identified and isolated nucleotide sequences encoding 
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Using WU-BLAST2 sequence alignment computer programs, it has been found thai a fdll-Iength native 
sequence PRO 1384 (shown in Figure 306 and SEQ ID NO:424) has amino acid sequence identity with NKG2- 
D (AF054819_l; Dayhoff database, version 35.45 SwissProt 35). Accordingly, it is presently believed that 
PR01384 disclosed in the present application is a newly identified member of the NKG2 family and may 
possess MHC activation/inaciivation activities typical of the NKG2 family. 

5 

B. PRO Polvpeptide Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated 
that PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide 
changes into the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art will 

10 appreciate that amino acid changes may alter post-translational processes of the PRO, such as changing the 
number or position of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence PRO or in various domains of the PRO described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative 
mutations set forth, for instance, in U.S. Patent No. 5,364.934. Variations may be a substimtion, deletion or 

15 insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the 
PRO as compared with the native sequence PRO. Optionally the variation is by substitution of at least one 
amino acid with any other amino acid in one or more of the domains of the PRO. Guidance in determining 
which amino acid residue may be inserted , substimted or deleted wi thout adversely affecting the desired activity 
may be found by comparing the sequence of the PRO with that of homologous known protein molecules and 

20 minimizing the number of amino acid sequence changes made in regions of high homology. Amino acid 
substitutions can be the result of replacing one amino acid with another amino acid having similar structural 
and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid 
replacements. Insertions or deletions may optionally be in the range of about 1 to 5 amino acids. The variation 
allowed may be determined by systematically making insertions, deletions or substitutions of amino acids in 

25 the sequence and testing the resulting variants for activity exhibited by the full-length or mature native 
sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus 
or C-terminus, or may lack internal residues, for example, when compared with a full length native protein. 
Certain fragments lack amino acid residues that are not essential for a desired biological activity of the PRO 
30 polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by 
enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by 
particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the 
35 desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding 
a desired polypeptide fragment, by polymerase chain reaction (PGR). Oligonucleotides that defme the desired 
termini of the DNA fragment are employed at the 5* and 3' primers in the PGR. Preferably, PRO polypeptide 
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fragments share at least one biological and/or immunological activity with the native PRO polypeptide disclosed 
herein. 

In particular embodiments, conservative substitutions of interest are shown in Table 6 under the 
heading of preferred substitutions. If such substitutions result in a change in biological activity, then more 
substantial changes, denominated exemplary substitutions in Table 6, or as further described below in reference 
to amino acid classes, are introduced and the products screened. 

Table 6 



Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Substimtions 


Ala (A) 


val; leu; ile 


val 


Arg(R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp (D) 


glu 


glu 


Cys(C) 


ser 


ser 


Ghi(Q) 


asn 


asn 


Glu(E) 


asp 


asp 


Gly (G) 


pro; ala 


ala 


His(H) 


asn; gin; lys; arg 


arg 


He (I) 


leu; val; met; ala; phe; 






norleucine 


leu 


Uu(L) 


norleucine; ile; val; 






met; ala; phe 


Ue 


Lys (K) 


arg; gin; asn 


arg 


Met (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala; tyr 


leu 


Pro(P) 


ala 


ala 


Ser(S) 


thr 


thr 


Thr (T) 


ser 


ser 


Trp(W) 


tyr; phe 


tyr 


Tyr (Y) 


trp; phe; thr; ser 


phe 


Val(V) 


ile; leu; met; phe; 






ala; norleucine 


leu 



Substantial modifications in fimction or immunological identity of the PRO polypeptide are 
accomplished by selecting substitutions that differ significantly in their effect on maintaining (a) the structure 
of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, 
(b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally 
occurring residues are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 
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Non-conservative substitutions will entail exchanging a member of one of these classes for another 
class. Such substituted residues also may be introduced into the conservative substitution sites or, more 
preferably, into the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al. , Nucl. 
5 Acids Res. . 13:433 1 (1986); Zoller et al.. Nucl. Acids Res. . IQ:6487 (1987)], cassette mutagenesis [Wells et 
al.fiene, 24:315 (1985)], restriction selection mutagenesis rWells et al.. Philos. Trans . R. Soc. London SerA. 
317:415 (1986)1 or other known techniques can be performed on the cloned DNA to produce die PRO variant 
DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
10 contiguous sequence. Aniong the preferred scaiming amino acids are relatively small, neutral amino acids. 
Such amino acids include alanine; glycine, serine, and cysteine. Alanine is typically a preferred scanning 
amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to 
alter the main-chain conformation of the variant [Cunningham and Wells, Science . 244: 1081-1085 (1989)]. 
Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently found 
15 in both buried and exposed positions [Creighton, The Proteins . (W.H. Freeman & Co., N,Y.); Chothia, y 
Mol. Biol. . 150 : 1 ( 1 976)] . If alanine substitution does not yield adequate amounts of variant, an isoteric amino 
acid can be used. 

C. Modifications of PRO 

20 Covalent modifications of PRO are included within the scope of this invention. One type of covalent 

modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic derivatizing 
agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PRO. 
Derivatization with bifunctional agents is useful, for instance, for crosslinking PRO to a water-insoluble 
support matrix or surface for use in the method for purifying anti-PRO antibodies, and vice- versa. Commonly 

25 used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifimctional imidoesters, 
including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such 
as bis-N-raaleimido-1 ,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioiinidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 

30 glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side 
chains [T.E. Creighton, Proteins: Strucmre and Molecular Properties . W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidadon of any C-terminal carboxyl group. 

Another type of covalent modification of the PRO polypeptide included within the scope of this 

35 invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native 
glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties 
found in native sequence PRO (eidier by removing the underlying glycosylation site or by deleting the 
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gtycosyladon by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not 
present in the native sequence PRO. In addition, the phrase includes qualitative changes in the glycosylation 
of the native proteins, involving a change in the nature and proportions of the various carbohydrate moieties 
present. 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino 
5 acid sequence. The alteration may be made, for example, by the addition of, or substitution by. one or more 
serine or threonine residues to the native sequence PRO (for 0~linked glycosylation sites). The PRO amino 
acid sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the 
desired amino acids. 

10 Another means of increasing the ntmiber of carbohydrate moieties on the PRO polypeptide is by 

chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g. , 
in WO 87/05330 published 1 1 September 1987, and in Aplin and Wriston, CRCCrit. Rev. Biochem. . pp. 259- 
306(1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 
15 or enzymaiically or by mutational substitution of codons encoding for amino acid residues that serve as targets 
for glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by 
Haldmuddin, et al., Arch. Biochem. Biophvs. . 259:52 (1987) and by Edge et al.. Anal. Biochem. . 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzvmol. . 138:350 (1987). 
20 Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a 

variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or 
polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640.835; 4,496,689; 4,301,144; 4.670,417; 
4,791.192 or 4.179,337. 

The PRO of the present invention may also be modified in a way to form a chimeric molecule 
25 comprising PRO fused to another, heterologous polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide 
which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally 
placed at the amino- or carboxyl- terminus of the PRO. The presence of such epitope-tagged forms of the PRO 
can be detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 
30 PRO to be readily purified by affmity purification using an anti-tag antibody or another type of affmity matrix 
that binds to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag 
polypeptide and its antibody 12CA5 [Field et al.. Mol. Cell. Biol. . 8:2159-2165 (1988)]; the c-myc tag and 
the 8F9, 3C7, 6E10, G4. B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular Biology . 
35 5:3610-3616 (1985)]; and the Herpes Simplex virus' glycoprotein D (gD) lag and its antibody [Paborsky et al.. 
Protein Engineering . 3(6):547-553 (1990)]. Other tag polypeptides include the Bag-peptide (Hopp et al., 
BioTechnology . 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al.. Science . 255 : 192-194 (1992)]; 
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an a-tubuUn epitope peptide [Skinner et al., J. Biol. Chem. . 266:15163-15166 (1991)]; and the T7 gene 10 
protein peptide tag [Lutz-Freyermuih et al., Proc. Natl. Acad. Sci. USA . 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule 
(also referred to as an "immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The 
5 Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form 
of a PRO polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred 
embodiment, the immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI , CH2 and CH3 
regions of an IgG 1 molecule. For the production of immunoglobulin fiisions see also US Patent No. 5,428,130 
issued June 27, 1995. 

10 

D. Preparation of PRO 

The description below relates primarily to production of PRO by culturing cells transformed or 
transfected with a vector containing PRO nucleic acid. It is, of course, contemplated that alternative methods, 
which are well knovm in the art, may be employed to prepare PRO. For instance, the PRO sequence, or 

15 portions thereof, may be produced by direct peptide synthesis using solid-phase techniques [see. e.g. , Stewart 
et al., Solid-Phase Peptide Svnthesis . W.H. Freeman Co., San Francisco, CA (1969); Merrifield, J. Am. 
Chem. Soc. . 85:2149-2154 (1963)]. In vitro protem synthesis may be performed using manual techniques or 
by automation. Automated synthesis may be accomplished, for instance, using an Applied Biosy stems Peptide 
Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the PRO may be 

20 chemically synthesized separately and combined using chemical or enzymatic methods to produce the full- 
length PRO. 

1. Isolation of DNA Encoding PRO 
DNA encoding PRO may be obtained from a cDNA library prepared from tissue believed to possess 
25 the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be conveniently 
obtained from a cDNA library prepared from human tissue, such as described in the Examples. The PRO- 
encoding gene may also be obta incd from a genomic library or by known synthetic procedures (e.g., automated 
nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least 
30 about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA 
or genomic library with the selected probe may be conducted using standard procedures, such as described in 
Sambrooket al. , Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 
1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology [Sambrook et al. , 
supra : Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)] . 
35 The Examples below describe techniques for screening a cDNA library. The oligonucleotide 

sequences selected as probes should be of sufficient length and sufficiently unambiguous that false positives 
are minimized. The oligonucleotide is preferably labeled such that it can be detected upon hybridization to 
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DNA in the library being screened. Methods of labeling are well known in the art, and include the use of 
radiolabels like "P-labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including 
moderate stringency and high stringency, are provided in Sambrook et al., supra . 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence databases. 
5 Sequence identity (at either the amino acid or nucleotide level ) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDN A or genomic 
libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional pruner extension procedures as described in Sambrook et al.. supra , to detect precursors and 
10 processing intermediates of niRNA that may not have been reverse-transcribed into cDNA. 

2. Selection and Transformati on of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for PRO 
production and cultured in conventional nutrient media modified as appropriate for inducing promoters, 
15 selecting transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such 
as media, temperature. pH and the like, can be selected by the skilled artisan without undue experimentation. 
In general, principles, protocols, and practical techniques for maximizing the productivity of cell cultures can 
be found in Mammalian Cell Biotechnologv: a Practical Approach . M. Butler, ed. (IRL Press, 1991) and 
Sambrook et al., supra . 

20 Methods of eukaryotic cell transfection and prokaiyotic cell transformation are known to the ordinarily 

skilled artisan, for example, CaClj, CaPO^, liposome-mediated and electroporation. Depending on the host 
cell used, transformation is performed using standard techniques appropriate to such cells. The calcium 
treatment employing calcium chloride, as described in Sambrook et al., supra , or electroporation is generally 
used for prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant 

25 cells, as described by Shaw et al., Gene . ^:315 <1983> and WO 89/05859 published 29 June 1989. For 
manunalian cells without such cell walls, the calcium phosphate precipitation method of Graham and van der 
Eb, Virology . 52:456-457 (1978) can be employed. General aspects of mammalian cell host system 
transfections have been described in U.S. Patent No. 4,399,216. Transformations into yeast are typically 
carried out according to the method of Van Solingen et al., J. Bact. . 13Q:946 (1977) and Hsiao et al., Proc. 

30 Natl. Acad. Sci. (USA) . 76:3829 (1979). However, other methods for introducing DNA into cells, such as 
by nuclear microinjection, electroporation, bacterial protoplast fusion with intact cells, or polycations. e.g., 
polybrene, polyornithine, may also be used. For various techniques for transforming mammalian cells, see 
Keownetal.. Methods in Enzvmology . 1 85: 527-537 (1990) and Mansouretal.. Nature. 336:348-352(1988). 
Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, 

35 or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram- 
negative or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains 
are publicly available, such as £. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); 
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E, coli strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host ceUs 
include Enterobacteriaceae such as Escherichia, e.g., E. coU, Enterobacter, Erwinia, Klebsiella^ Proteus^ 
Salmonella, e.g.. Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli 
suchasB. subtilismdB. lichenifomis(e.g., B. lichenifonnis 41P disclosed in DD266J 10 published 12 April 
1989), Pseudontonas such as P. aeruginosa, and Streptomyces. These examples are illustrative rather than 
5 limiting. Strain W3 110 is one particularly preferred host or parent host because it is a common host strain for 
recombinant DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic 
enzymes. For example, strain W31 10 may be modified to effect a genetic mutation in the genes encoding 
proteins endogenous to the host, with examples oif such hosts including E. coli W31 10 strain 1 A2, which has 
the complete genotype tonA ; £. coli W31 10 strain 9E4, which has the complete genotype tonA ptr3; £. coli 

10 W31 10 strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptr3 pkoA El 5 (argF-lac)169 
degP ompT karf; E. coli W3 1 10 strain 37D6, which has the complete genotype tonA ptrS phoA EI5 (argF- 
lac)269 degP ompT rbs7ilvGkarf; E, coU W31 10 strain 40B4, which is strain 37D6 with a non-kanamycin 
resistant degP deletion mutation; and an E. coli strain having mutant periplasmic protease disclosed in U.S. 
Patent No. 4,946.783 issued 7 August 1990. Alternatively, in vitro methods of cloning, e.g., PGR or other 

15 nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower 
eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature . 290: 
140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4.943,529; Fleer et 

20 al., Bio/Technology . 9:968-975 (1991)) such as, e.g., ^C. lactis (MW98-8C, CBS683. CBS4574; Louvcncourt 
et al., J. Bacteriol. . 154(2):737-742 [1983]). K, fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. 
wickerami (ATCC 24,178), K. waltu (ATCC 56,500), K, drosophilarum (ATCC 36.906; Van den Berg et 
al., Bio/Technologv . 8:135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 402,226); Pichia 
pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol. . 28:265-278 [1988]); Candida; Trichoderma 

25 reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA . 76:5259-5263 [1979]); 
Schwanniomyces such as Schwanrdomyces occidentalis (EP 394,538 published 31 October 1990); and 
filamentous fungi such as, e.g. . Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 January 
1991), md Aspergillus hosts such as A. nidulans (Ballance etal.. Biochem. Biophvs. Res. Commun. . 1 12:284- 
289 [1983]; Tilbum et al.. Gene, 26:205-221 [1983]; Yelton et al„ Proc. Natl. Acad. Sci. USA . 81: 1470- 

30 1474 [1984]) and A. niger (Kelly and Hynes, EMBOJ. . 4:475-479 [1985]). Methylotropic yeasts are suitable 
herein and include, but are not limited to, yeast capable of growth on methanol selected from the genera 
consisting of Hansenula, Candida, Kloeckera. Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list 
of specific species that are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistry 
of Methvlotrophs . 269 (1982). 

35 Suitable host cells for the expression of glycosylated PRO are derived from multicellular organisms. 

Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant 
cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. 
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More specific examples include monkey kidney CVl line transfonned by SV40 (COS-7, ATCC CRL 1651); 
human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., 
J. Gen Virol. . 36:59 (1977)); Chinese hamster ovary cellsZ-DHFR (CHO, Urlaub and Chasin. Proc. Natl. 
Acad. Sci. USA . 77:4216 (1980)); mouse sertoii cells (TM4, Mather, Biol. Reprod. . 23:243-251 (1980)); 
human lung cells (W138, ATCC CCL 75); human liver cells (Hep 02, HB 8065); and mouse mammary tumor 
(MMT 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill in 
the art. 



3. $gl^tiQp and y$g a Repljg^tyle Vectpr 

The nucleic acid (e.g., cDNA or genomic DNA) encoding PRO may be inserted into a replicable 
vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The 
vector may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate 
nucleic acid sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted 
into an appropriate restriction endonuclease site(s) using techniques known in the art. Vector components 
generally include, but are not limited to, one or more of a signal sequence, an origin of replication, one or 
more marker genes, an enhancer element, a promoter, and a transcription termination sequence. Construction 
of suitable vectors containing one or more of these components employs standard ligation techniques which 
are known to the skilled artisan. 

The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-tenninus of the mature protein or polypeptide. In general, the signal sequence may be a component 
of the vector, or it may be a part of the PRC-encoding DNA that is inserted into the vector. The signal 
sequence may be a prokaryotic signal sequence selected, for example, from the group of the alkaline 
phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence 
may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces a- 
facior leaders, the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C albicans 
glucoamylase leader (EP 362, 179 published 4 April 1990), or the signal described in WO 90/1 3646 published 
15 November 1990. In mammalian cell expression, mammalian signal sequences may be used to direct 
secretion of the protein, such as signal sequences from secreted polypeptides of the same or related species, 
as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2/a plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful 
for cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g. , ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical 
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nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for BaciUi. 

An^ example of suitable selectable markers for mammalian cells are those that enable the identification 
of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, 
prepared and propagated as described by Urlaub et al., Proc. Natl. Acad. Sci. USA . 77:4216 (1980). A 
5 suitable selection gene for use in yeast is the trp\ gene present in the yeast plasmid YRp7 [Stinchcomb et al., 
Nature, 282:39 (1979); Kingsman et al.. Gene . 7:141 (1979); Tschemper et al., Gene . 10:157 (1980)]. The 
trpl gene provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, 
for example, ATCC No. 44076 or PEP4-1 [Jones, Genetics . 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding 

10 nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are 
well known. Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose promoter 
systems [Chang et al. , Nature . 275:615 (1978); Goeddel et al. , Nature . 281 :544 (1979)], alkaline phosphatase, 
a tryptophan (trp) promoter system [Goeddel, Nucleic Acids Res. . 8:4057 (1980); EP 36,776), and hybrid 
promoters such as the tac promoter [deBoer et al. , Proc. Natl. Acad. Sci. USA . 80:21-25 (1983)]. Promoters 

15 for use in bacterial systems also will contain a Shine-Dalgamo (S.D.) sequence operably linked to the DNA 
encoding PRO. 

Exanq>les of suitable promoting sequences for use with yeast hosts include the promoters for 3- 

phosphoglycerate kinase [Hitzeman et al . , J.Biol. Chem. . 255 :2073 ( 1 980)] or other glycolytic enzymes [Hess 

et al., J. Adv. Enzyme Reg. . 7:149 (1968); Holland, Biochemistry . 17:4900 (1978)], such as enolase, 
20 glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofiructokinase, 

glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, 

phosphoglucose isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 

controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
25 phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3- 

phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 

promoters for use in yeast expression are further described in EP 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 

obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1 ,504 published 5 July 
30 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 

retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the 

actin promoter or an immimoglobulin promoter, and from heat-shock promoters, provided such promoters are 

compatible with the host cell systems. 

Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an 
35 enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 

bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from 

mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use 
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an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at 
a position 5' or 3* to the PRO coding sequence, but is preferably located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or 
5 nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination 
of transcription and for stabilizing the mRNA. Such sequences are commonly available from the S* and, 
occasionally 3 ' , untranslated regions of eukaryotic or viral DNAs or cDN As. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated ponion of the mRNA encoding PRO. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PRO in 
10 recombinant vertebrate cell culmre are described in Geihing et al.. Nature . 293:620-625 (1981); Mantei et al, , 
Nature, 281:40-46 (1979); EP 117.060; and EP 117.058. 

4. Detecting Gene Amplification/Expression 
Gene amplification and/or expression may be measured in a sample directly, for example, by 

1 5 conventional Southern blotting. Northern blotting to quantitate the transcription of mRNA fThomas, Proc. Natl. 
Acad. Sci. USA . 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA 
hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried 

20 out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence 
of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for inununohistochemical staining and/or assay of 

25 sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, 
the antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based 
on the DNA sequences provided herein or against exogenous sequence fused to PRO DNA and encoding a 
specific antibody epitope. 

30 5. Purification of Polypeptide 

Forms of PRO may be recovered from culmre medium or from host cell lysates. If membrane-bound, 
it can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic 
cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical means, such 
as freeze-thaw cycling, sonication. mechanical disruption, or cell lysing agents. 

35 It may be desired to purify PRO from recombinant cell proteins or polypeptides. The following 

procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; 
ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as 
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DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, 
Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating 
columns to bind epitope-tagged forms of the PRO. Various methods of protein purification may be employed 
and such methods are known in the art and described for exan^ile in Deutscher, Methods in Enzvmologv . 182 
(1990); Scopes, Protein Purific ation: Principles and Practice. Springer-Verlag, New York (1982). The 
5 purification step(s) selected will depend, for example, on the nature of the production process used and the 
particular PRO produced. 

Nucleotide sequences (or their complement) encoding PRO have various applications in the art of 

10 molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the 
generation of anti-sense RNA and DNA. PRO nucleic acid will also be useful for the preparation of PRO 
polypeptides by the recombinant techniques described herein. 

The full-length native sequence PRO gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length PRO cDNA or to isolate still other cDNAs (for instance, those 

15 encoding naturally-occurring variants of PRO or PRO from other species) which have a desired sequence 
identity to the native PRO sequence disclosed herein. Optionally, the length of the probes will be about 20 to 
about 50 bases. The hybridization probes may be derived from at least partially novel regions of the full length 
native nucleotide sequence wherein those regions may be detennined without undue experimentation or from 
genomic sequences including promoters, enhancer elements and introns of native sequence PRO. By way of 

20 example, a screening method will comprise isolating the coding region of the PRO gene using the known DNA 
sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety 
of labels, including radionucleotides such as '^P or ^'S, or enzymatic labels such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary 
to that of the PRO gene of the present invention can be used to screen libraries of human cDNA. genomic 

25 DNA or mRNA to determine which members of such libraries the probe hybridizes to. Hybridization 
techniques are described in further detail in the Examples below. 

Any EST sequences disclosed in the present application may similarly be employed as probes, using 
the methods disclosed herein. 

Other useful fragments of the PRO nucleic acids include antisense or sense oligonucleotides 

30 comprising a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target PRO 
mRNA (sense) or PRO DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the 
present invention, comprise a fragment of the coding region of PRO DNA. Such a fragment generally 
comprises at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, 

35 for example, Stein and Cohen (Cancer Res. 48:2659, 1988) and van der Krol et al. (BioTechniaues 6:958, 
1988). 
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Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation 
of duplexes that block transcription or translation of the target sequence by one of several means, including 
enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. 
The antisense oligonucleotides thus may be used to block expression of PRO proteins. Antisense or sense 
oligonucleotides further comprise oligonucleotides having modified sugar-phosphodiester backbones (or other 
sugar linkages, such as those described in WO 91/06629) and wherein such sugar linkages are resistant to 
endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of 
resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide 
sequences. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which are 
covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that 
increases affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, 
intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or 
antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target 
nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for exa^^)le, CaP04-mediated DNA transfection, 
electroporation, or by using gene transfer vectors sudi as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target 
nucleic acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable 
retroviral vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a 
TBtTovirus derived from M-MuLV), or the double copy vectors designated DCT5A. DCT5B and DCT5C (see 
WO 90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
ligand binding molecules include, biit are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the 
target nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. 
The sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous 
lipase. 

Antisense or sense RNA or DNA molecules are generally at least about 5 bases in length, about 10 
bases in length, about 15 bases in length, about 20 bases in length, about 25 bases in length, about 30 bases 
in length, about 35 bases in length, about 40 bases in length, about 45 bases in length, about 50 bases in length, 
about 55 bases in length, about 60 bases in length, about 65 bases in length, about 70 bases in length, about 
75 bases in length, about 80 bases in length, about 85 bases in length, about 90 bases in length, about 95 bases 
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in length, about 100 bases in length, or more. 

The probes may also be employed in PGR techniques to generate a pool of sequences for identification 
of closely related PRO coding sequences. 

Nucleotide sequences encoding a PRO can also be used to construct hybridization probes for mapping 
the gene which encodes that PRO and for the genetic analysis of individuals with genetic disorders. The 
5 nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

When the coding sequences for PRO encode a protein which binds to another protein (example, where 
the PRO is a receptor), the PRO can be used in assays to identify the other proteins or molecules involved in 

10 the binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be 
identified. Proteins involved in such binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. Also, the receptor PRO can be used to isolate 
correlative ligand(s). Screening assays can be designed to fmd lead compounds that mimic the biological 
activity of a native PRO or a receptor for PRO. Such screening assays will include assays amenable to high- 

15 throughput screening of chemical libraries, making them particularly suitable for identifying small molecule 
drug candidates. Small molecules contemplated include synthetic organic or inorganic compounds. The assays 
can be performed in a variety of formats, including protein-protein binding assays, biochemical screening 
assays, immunoassays and cell based assays, which are well characterized in the art. 

Nucleic acids which encode PRO or it3 modified forms can also be used to generate either transgenic 

20 ' animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, 
which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 
stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 

25 accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain cells which express DNA encoding PRO. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. 
Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for PRO transgene 
incorporation with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding 

30 PRO introduced into the germ line of the animal at an embryonic stage can be used to examine the effect of 
increased expression of DNA encoding PRO. Such animals can be used as tester animals for reagents thought 
to confer protection from, for example, pathological conditions associated with its overexpression. In 
accordance with this facet of the invention, an animal is treated with the reagent and a reduced incidence of 
the pathological condition, compared to untreated animals bearing the transgene, would indicate a potential 

35 therapeutic intervention for the pathological condition. 

Alternatively, non-human homologues of PRO can be used to construct a PRO "knock out" animal 
which has a defective or altered gene encoding PRO as a result of homologous recombination between the 
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endogenous gene encoding PRO and altered genomic DNA encoding PRO introduced into an embryonic stem 
cell of the ammal. For example, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques. A portion of the genomic DNA encoding PRO can be deleted or 
replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 
integration. Typically, several kilobases of unaltered flanking DNA (both at the 5* and 3' ends) are included 
5 in the vector [see e.g., Thomas and Capecchi, Cell* 51:503 (1987) for a description of homologous 
recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) 
and cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected 
[see e.g., Li ct al., geU, 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal 
(e.g., a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic 

10 Stem Cells: A Practical Approach, E. J. Robertson, ed. (IRL. Oxford, 1987), pp. 113-152]. A chimeric 
embryo can then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to 
term to create a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ 
cells can be identified by standard techniques and used to breed animals in which all cells of the animal contain 
the homologously recombined DNA. Knockout anunals can be characterized for instance, for their ability to 

15 defend against certain pathological conditions and for their development of pathological conditions due to 
absence of the PRO polypeptide. 

Nucleic acid encoding the PRO polypeptides may also be used in gene therapy. In gene therapy 
applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective 
genetic product, for example for replacement of a defective gene, "(jene therapy" includes both conventional 

20 gene therapy where a lasting effect is achieved by a single treatmem. and the administration of gene therapeutic 



agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. 
Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in 
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act 
as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell 



25 membrane. (Zamecnik c/ a/. . Proc. Natl. Acad. Sci. USA 83:4143-4146 ri9861). The oligonucleotides can 
be modified to enhance their uptake', e.g. by substituting their negatively charged phosphodiester groups by 
uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo 
30 in the cells of the intended host. Techniques suiuble for the transfer of nucleic acid into mammalian cells in 



vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium 
phosphate precipitation method, etc. The currently preferred in vivo gene transfer techniques include 
transfection with viral (typically retroviral) vectors and viral coat protein-! iposome mediated transfection (Dzau 
et al., Trends in Biotechnology 11, 205-210 [1993]). In some situations it is desirable to provide the nucleic 



protein or the target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, 
proteins which bind to a cell surface membrane protein associated with endocytosis may be used for targeting 



35 



acid source with an agent that targets the target cells, such as an antibody specific for a cell surface membrane 
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and/or to facilitate uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies 
for proteins which undergo internalization in cycling, proteins that target intracellular localization and enhance 
intracellular half-life. The technique of receptor-mediated endocytosis is described, for example, by Wu et 
al., J. Biol. Chem. 262, 4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87. 3410-3414 
(1990). For review of gene marking and gene therapy protocols see Anderson et al.. Science 256, 808-813 
(1992). 

The PRO polypeptides described herein may also be employed as molecular weight markers for 
protein electrophoresis purposes and the isolated nucleic acid sequences may be used for recombinantly 
expressing those markers. 

The nucleic acid molecules encoding the PRO polypeptides or fragments thereof described herein are 
useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 
markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each PRO nucleic acid molecule of the present invention can be used as a chromosome marker. 

The PRO polypeptides and nucleic acid molecules of the present invention may also be used for tissue 
typing, wherein the PRO polypeptides of the present invention may be differentially expressed in one tissue 
as compared to another. PRO nucleic acid molecules will fmd use for generating probes for PGR, Northern 
analysis. Southern analysis and Western analysis. 

The PRO polypeptides described herein may also be employed as therapeutic agents. The PRO 
polypeptides of the present invention can be formulated according to known methods to prepare 
pharmaceutically useful compositions, whereby the PRO product hereof is combined in admixture with a 
pharmaceuticaliy acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the 
active ingredient having the desired degree of purity with optional physiologically acceptable carriers, 
excipients or stabilizers (Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form 
of lyophilized formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic 
to recipients at the dosages and concentrations employed, and include buffers such as phosphate, citrate and 
other organic acids; antioxidants including ascorbic acid; low molecular weight Oess than about 10 residues) 
polypeptides; proteins, such as serum albumin, gelatin or Immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone, amino acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, 
disaccharides and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannltol or sorbitol; salt-forming counterions such as sodium; and/or nonionic 
surfactants such as TWEEN"'". PLURONICS*^" or PEG. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished 
by filtration through sterile filtration membranes, prior to or following lyophilization and reconstitution. 

Therapeutic compositions herein generally are placed into a container having.a sterile access port, for 
example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infiision by 
intravenous, intraperitoneal, intracerebral, intramiiscular, intraocular, intraarterial or intralesional routes, 
topical administration, or by sustained release systems. 
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Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may 
vary depending on die particular use envisioned. The determination of the appropriate dosage or route of 
administration is well within the skill of an ordinary physician. Animal experiments provide reliable guidance 
for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be 
performed following the principles laid down by Mordenti, J. and Chappell» W. "The use of interspecies 
5 scaling in toxicokinetics* In Toxicokinetics and New Drug Development, Yacobi et al. , Eds. , Pergamon Press, 
New York 1989, pp. 42-96. 

When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, 
normal dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more 
per day. preferably about 1 ^g/kg/day to 10 mg/kg/day , depending upon the route of administration. Guidance 

10 as to particular dosages and methods of delivery is provided in die literanire; see, for example, U.S. Pat. Nos. 
4,657,760; 5.206,344; or 5,225,212. It is anticipated that different formulations will be effective for different 
treatment compounds and different disorders, that administration targeting one organ or tissue, for example, 
may necessitate delivery in a maimer different from that to another organ or tissue. 

Where sustained-release administration of a PRO polypeptide is desired in a formulation with release 

15 characteristics suitable for the treatment of any disease or disorder requiring administration of the PRO 
polypeptide, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant 
proteins for sustained release has been successfully performed with human growth hormone (rhGH), interferon- 
(rhlFN- ), interIeukin-2, and MN rgpl20. Johnson et al., Nat. Med. . 2:795-799 (1996); Yasuda, Biomed. 
Then. 27: 1221-1223 (1993); Hora etal.. Bio/Technology. 8:755-758 (1990); Cleland. "Design and Production 

20 of Single Immimization Vaccines Using Polylactide Polyglycolide Microsphere Systems, " in Vaccine Design: 
The Subunit and Adjuvant Approach . Powell and Newman, eds. (Plenum Press: New York, 1995). pp. 439- 
462; WO 97/03692, WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654.010. 

The sustained-release formulations of these proteins were developed using poly-Iactic-coglycolic acid 
(PLGA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation 

25 products of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the 
degradability of this polymer can be adjusted from months to years depending on its molecular weight and 
composition. Lewis, "Controlled release of bioactive agents from lactide/glycolide polymer," in: M. Clhasin 
and R. Langer (Eds.), Biodegradable Polvmers as Drug Deliverv Svstems (Marcel Etekker: New York, 1990), 
pp. 1-41. 

30 This invention encompasses methods of screening compounds to identify those that mimic the PRO 

polypeptide (agonists) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the PRO polypeptides 
encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins . Such screening assays will include assays amenable to high-throughput screening 

35 of chemical libraries, making them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 
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All assays for antagonists are common in that they call for contacting the drug candidate with a PRO 
polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herem or 
the drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution 
of the PRO polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, 
specific for die PRO polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is 
performed by adding the non-immobilized component, which may be labeled by a detectable label, to the 
immobilized component, e.g., the coated surface containing the anchored component. When the reaction is 
complete, the non-reacted components are removed, e.g., by washing, and complexes anchored on the solid 
surface are detected. When the originally non-immobilized component carries a detectable label, the detection 
of label immobilized on the surface indicates that complexing occurred. Where the originally non-inunobilized 
component does not carry a label, complexing can be detected, for example, by using a labeled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded 
by a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for 
detecting protein-protein interactions. Such assays include traditional approaches, such as, e.g. , cross-linking, 
co-immunoprecipitation, and co-piu:ification through gradients or chromatographic columns. In addition, 
protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields and 
co-workers (Fields and Song, NaUire (London) . 340:245-246 (1989); Chien et al., Proc. Natl. Acad. Sci. 
USA . 88:9578-9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA . 89: 5789- 
5793 (1991). Many transcriptional activators, such as yeast GAM, consist of two physically discrete modular 
domains, one acting as the DNA-binding domain, the other one fimctioning as the transcription-activation 
domain. The yeast expression system described in the foregoing publications (generally referred to as the 
"two-hybrid system") takes advantage of this property, and employs two hybrid proteins, one in which the 
target protein is fused to the DNA-binding domain of GAL4, and another, in which candidate activating 
proteins are fused to the activation domain. The expression of a GALl-tecZ reporter gene under control of 
a GAL4-activated promoter depends on reconstitution of GAL4 activity via protein-protein interaction. 
Colonies containing interacting polypeptides are detected with a chromogenic substrate for p-galactosidase. 
A complete kit (MATCHMAKER™) for identifying protein-protein interactions between two specific proteins 
using the two-hybrid technique is commercially available from Clontech. This system can also be extended 
to map protein domains involved in specific protein interactions as well as to pinpoint amino acid residues that 
are crucial for these interactions. 

Compounds that interfere with the interaction of a gene encoding a PRO polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of the gene and the intra- or extracellular component under conditions and for a time 
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allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
' inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixmre, to serve as positive control. The binding (complex 
formation) between the test compound and the intra- or extracellular component present in the mixture is 
monitored as described hereinabove. The formation of a complex in the control reaction(s) but not in the 
5 reaction mixture containing the test compound indicates that the test compound interferes with the interaction 
of the test compound and its reaction partner. 

To assay for antagonists, the PRO polypeptide may be added to a cell along with the compound to be 
screened for a particular activity and the ability of the compound to inhibit the activity of interest in the 
presence of the PRO polypeptide indicates that the compound is an antagonist to the PRO polypeptide. 

10 Alternatively, antagonists may be detected by combining the PRO polypeptide and a potential antagonist with 
membrane-bound PRO polypeptide receptors or recombinant receptors under appropriate conditions for a 
competitive inhibition assay. The PRO polypeptide can be labeled, such as by radioactivity, such that the 
number of PRO polypeptide molecules bound to the receptor can be used to determine the effectiveness of the 
potential antagonist. The gene encoding the receptor can be identified by numerous methods known to those 

15 of skill in the art, for example, ligand panning and FACS sorting. Coligan et al.. Current Protocols in 
Immun. . 1(2): Chapter 5 (1991). Preferably, expression cloning is employed ^^^erein polyadenylated RNA 
is prepared from a cell responsive to the PRO polypeptide and a cDNA library created from this RNA is 
divided into pools and used to transfect COS cells or other cells that are not responsive to the PRO polypeptide . 
Transfected cells that are grown on glass slides are exposed to labeled PRO polypeptide. The PRO polypeptide 

20 can be labeled by a variety of means including iodination or inclusion of a recognition site for a site-specific 
protein kinase. Following fixation and incubation, the slides are subjected to autoradiographic analysis. 
Positive pools are identified and sub-pools are prepared and re-transfected using an interactive sub-pooling and 
re-screening process, eventually yielding a single clone that encodes the putative receptor. 

As an alternative approach for receptor identification, labeled PRO polypeptide can be photoaffinity- 

25 linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material 
is resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragiiients, and subjected to protein micro-sequencing. The amino acid sequence obtained 
from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 
library to identify the gene encoding the putative receptor. 

30 In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 

would be incubated with labeled PRO polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions 
of immunoglobulin with PRO polypeptide, and, in particular, antibodies including, without limitation, poly- 

35 and monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and 
chimeric or humanized versions of such antibodies or firagments, as well as human antibodies and antibody 
fragments. Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form 
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of the PRO polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the 
action of the PRO polypeptide. 

Another potential PRO polypeptide antagonist is an anlisense RN A or DN A construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation 
of mRNA by hybridizing to targeted mRNA and preventing protein translation. Anlisense technology can be 
5 used to control gene expression through triple-helix formation or antisense DNA or RNA, both of which 
methods arc based on binding of a polynucleotide to DNA or RNA. For example, the 5* coding portion of 
the polynucleotide sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense 
RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et al. , Nucl. Acids Res. . 

10 6:3073 (1979); Cooney et al., Science . 241: 456 (1988); Dervan et al., Science . 251:1360 (1991)), thereby 
preventing transcription and the production of the PRO polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the PRO polypeptide 
(antisense - Okano, Neurochem. . 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene 
Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered 

15 to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the PRO 
polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation 
site, e.g., between about -10 and + 10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 
growth factor or other relevant binding site of the PRO polypeptide, thereby blocking the normal biological 

20 activity of the PRO polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic 
compounds. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 

Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by 
25 endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by 

known techniques. For further details see, e.g., Rossi, Current Bioloev . 4:469-471 (1994), and PCT 

publication No. WO 97/33551 (published September 18, 1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 

and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
30 promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches 

of purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 

97/33551. JMpra. 

These small molecules can be identified by any one or more of the screening assays discussed 
hereinabove and/or by any other screening techniques well known for those skilled in the art. 
35 Uses of the herein disclosed molecules may also be based upon the positive functional assay hits 

disclosed and described below. 
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F. Ami-PRO Antibodies 
The present invention further provides anli-PRO antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific, and heteroconjugate antibodies. 



1. Polyclonal Antibodies 

5 The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 

antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, 
by one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing 
agent and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. 
The immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to 

10 conjugate the immunizing agent to a protein known to be inununogenic in the mammal being immunized. 
Examples of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be employed 
include Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The iimnunization protocol may be selected by one skilled in the art without undue 

15 experimentation. 

2. Monoclonal Antibodies 

The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may 
be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature . 256:495 

20 (1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized 
with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will 
specifically bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or 

25 spleen cells or lymph node cells are used if non-human mammalian sources are desired . The lymphocytes are 
then fused with an immortalized cell line using a suitable ftjsing agent, such as polyethylene glycol, to form 
a hybridoma cell [Coding, Monoclonal Antibodies: Principles and Practice . Academic Press, (1986) pp. 59- 
103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, 
bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may 

30 be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth 
or survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
include hypoxanthine, aminopterin. and thymidine ("HAT medium"), which substances prevent the growth of 
HGPRT-deficient cells. 

35 Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression 

of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. 
More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from 
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the Salk Institute Cell Distribution Center. San Diego, California and the American Type Culture Collection^ 
Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described 
for the production of human monoclonal antibodies [Kozbor, J. Immunol. . 132:3001 (1984); Brodeur et al.. 
Monoclonal Antibody Production Techniques and Applications . Marcel Dekker, Inc., New York» (1987) pp. 
51-63]. 

5 The culture medium in which the hybridoma cells are culmred can then be assayed for the presence 

of monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such 
as radioimmunoassay (RIA) or enzyme-linked inununoabsorbent assay (ELISA). Such techniques and assays 
are known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the 

10 Scatchard analysis of Munson and Pollard, Anal. Biochem. . 102:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods [Coding, supral . Suitable culture media for this purpose include, 
for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma 
cells may be grown in vivo as ascites in a manunal. 

15 The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 

medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily 

20 isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable 
of binding specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma 
cells of the invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into 
expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary 
(CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis 

25 of monoclonal antibodies in the recombinant host cells. The DNA also may be modified, for example, by 
substimting the coding sequence for human heavy and light chain constant domains in place of the homologous 
murine sequences [U.S. Patent No. 4,816,567; Morrison et al., supral or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. 
Such a non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of the 

30 invention, or can be substimted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well 
known in the art. For example, one method involves recombinant expression of immunoglobulin light chain 
and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent 

35 heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid 
residue or are deleted so as to prevent crosslinking. 
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In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known 
in the art. 

3. Hyman ^nd Humanize^ An^Mie? 
5 The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 

antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 
inraiunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab*)2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-hxmfian immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 

10 complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin, are replaced by 
corresponding non-human residues. Humanized antibodies may also comprise residues w^ch are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized 

15 antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immimoglobulin and all or 
substantially all of the PR regions are those of a human immunoglobulin consensus sequence. The humanized 
antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically 
that of a human immunoglobulin [Jones et al., Nature, 321:522-525 (1986); Riechmann et al.. Nature, 

20 222:323-329 (1988); and Presta. Curr. Op. Smict. Biol. . 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These 
non-human amino acid residues are often referred to as "import" residues, which are typically taken from an 
"import" variable domain. Humanization can be essentially performed following the method of Winter and 

25 co-workers [Jones et al.. Nature, 321:522-525 (1986); Riechmann et al.. Nature, 332:323-327 (1988); 
Verhoeyen et al.. Science . 222:1534-1536 (1988)], by substimting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric 
antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact hiiman variable domain has 
been substimted by the corresponding sequence from a non-human species. In practice, humanized antibodies 

30 are typically human antibodies in which some CDR residues and possibly some FR residues are substituted by 
residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage 
display libraries [Hoogenboom and Winter. J. Mol. Biol. . 227:381 (1991); Marks et al., J. Mol. Biol. . 
222:581 (1991)]. The techniques of Cole et al. and Boemer et al, are also available for the preparation of 

35 human monoclonal antibodies (Cole et al.. Monoclonal Antibodies and Cancer Theraov . Alan R. Liss, p. 77 
(1985) and Boemer et al., J. Inununol. . 147(0 :86-95 (1991)). Similarly, human antibodies can be made by 
introducing of hmnan immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
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immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody 
production is observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent 
Nos. 5,545,807; 5,545,806; 5,569.825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific 
publications: Marks et al.. Bio/Technology 10. 779-783 (1992); Lonberg etai,. Nature 368 856-859 (1994); 
5 Morrison, Nature 368, 812-13 (1994); Fishwild etaL, Nature Biotechnology 14. 845-51 (1996); Neuberger. 
Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. Immunol. 12 65-93 (1995). 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
10 specificities for at least two different antigens. In the present case, one of the binding specificities is for the 
PRO. the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, (he recombinant 
production of bispecific antibodies is based on the co-e)q)ression of two immunoglobulin heavy-chain/light- 

1 5 chain pairs, where the two heavy chains have different specificities [Milstein and Cuello. Nature . 305:537-539 
(1983)]. Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas 
(quadroraas) produce a potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in 

20 Traunecker et al.. EMBOJ. . 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) 
can be fiised to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 
heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred 
to have the furst heavy-chain constant region (CH 1) containing the site necessary for light-chain binding present 

25 in at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et al . , Methods 
in Enzvmologv . 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair of antibody 

30 molecules can be engineered to maximize the percentage of heterodimers which are recovered fixtm 
recombinant cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody 
constant domain. In this method, one or more small amino acid side chains from the interface of the first 
antibody molecule are replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" 
of identical or similar size to the large side chain(s) are created on the interface of the second antibody 

35 molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or threonine) . Th is provides 
a mechanism for increasing the yield of the heterodimer over other unwanted end-products such as 
homodimers. 
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Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab')2 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 
linkage. Brennan et aL, Science 229:81 (1985) describe a procedure wherein intact antibodies are 
proteolytically cleaved to generate F(ab')2 fragments. These fragments are reduced in the presence of the 
5 dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide 
formation. The Fab' fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of 
the Fab*-TNB derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The 
bispecific antibodies produced can be used as agents for the selective immobilization of enzymes. 

10 Fab' fragments may be directly recovered from E. coli and chemically coupled to form bispecific 

antibodies. Shalaby et al, , J. Exp. Med. 175:217-225 (1992) describe the production of a fiiUy humanized 
bispecific antibody F(ab*)2 molecule. Each Fab' fragment was separately secreted from E. coli and subjected 
to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 

15 activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant 
cell culture have also been described. For example, bispecific antibodies have been produced using leucine 
zippers. Kostelny et aL, J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos 
and Jim proteins were linked to the Fab* portions of two different antibodies by gene fusion. The antibody 

20 homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimers. This method can also be utilized for the production of antibody homodimers. The "diabody" 
technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain 
variable domain (Vh) connected to a light-chain variable domain (VJ by a linker which is too short to allow 

25 pairing between the two domains on the same chain. Accordingly, the Vh and V^ domains of one fragment 
are forced to pair with the con^jlementary V^ and Vh domains of another fragment, thereby forming two 
antigen-binding sites. Another strategy for making bispecific antibody fragments by the use of single-chain 
Fv (sFv) dimers has also been reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 
Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be 

30 prepared. Tutt et al. , J. Inmiunol. 147 :60 ( 1 99 1 ). 

Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide 
herein. Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2. CD3, CD28, or B7), or Fc receptors 
for IgG (FcyR), such as FcyRl (CD64), FcyRH (CD32) and FcyRlII (CD16) so as to focus cellular defense 

35 mechanisms to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to 
localize cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a 
PRO-binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, 
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DPTA , DOTA , or TETA . Another bispecific antibody of interest binds the PRO polypeptide and further binds 
tissue factor (TF). 

5. Heteroconiugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalenily joined antibodies. Such antibodies have, for example, been 
proposed to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of 
HIV infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be 
prepared in vitro using known methods in synthetic protein chemistry, including those involving crosslinking 
agents. For example, immunotoxins may be constructed using a disulfide exchange reaction or by forming 
a thioether bond. Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

6. Effpctor Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance. e,g, , the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be 
introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated may have improved internalization capability and/or increased 
complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron eial.^h. 
ExoMed .. 176 : 1191-1 195 (1992) and Shopes, J. Immunol .. 148 : 2918-2922 (1992). Homodimeric antibodies 
with enhanced anti-mmor activity may also be prepared using heterobifimctional cross-linkers as described in 
Wolffs/ al. Cancer Research . 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that has 
dual Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et 
al., Anti-Cancer Drug Design. 3: 219-230 (1989). 

7. Immunoconiugates 

The invention also pertains to immunoconjugates con^rising an antibody conjugated to a cytotoxic 
agent such as a chemotherapeutic agent, toxin (e.g. , an enzymatically active toxin of bacterial, fungal, plant, 
or animal origin, or fragments thereof), or a radioactive isotope (i.e.. a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immimoconjugates have been described 
above. Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, 
nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa)^ ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety 
of radionuclides are available for the production of radioconjugated antibodies. Examples include ^'^Bi, '^'I, 
"*In, *Y, and '"Re, Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP). 
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iminothiolane {TV), bifimctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters 
(such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p- 
azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethyleiiediamine), 
diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1 ,5-difluoro-2»4- 
dinitFobenzene). For example, a ricin immunotoxin can be prepared as described in Vitetta et al. , Science . 
5 238: 1098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepeniaacetic acid 
(MX-DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026, 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, 
10 followed by removal of unbound conjugate from the circulation using a clearing agent and then administration 
of a "ligand" (e.g.^ avidin) that is conjugated to a cytotoxic agent {e.g., a radionucleotide). 

8. Immunol iposomes 

The antibodies di sclosed herein may also be formulated as tmmunoliposomes . Liposomes containing 
15 the antibody are prepared by methods known in the art, such as described in Epstein et al., Proc. Natl. Acad. 
Sci. USA . §2: 3688 (1985); Hwang et al.. Proc. Natl Acad. Sci. USA . 17: 4030 (1980); and U.S. Pat. Nos. 
4,485,045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 
5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
20 composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phosphatidylethanolamine 
(PEG-PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired 
diameter. Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as 
described in Martin et al J. Biol. Chem. . 257 : 286-288 (1982) via a disulfide-imerchange reaction. A 
chemotherapeutic agent (such as Doxorubicin) is optionally contained within the liposome. See Gabizon et al. , 
25 J. National Cancer Inst. . 81(19): 1484 (1989). 

9. Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a PRO polypeptide identified herein, as well as other molecules 
identified by the screening assays disclosed hereinbefore, can be administered for the treatment of various 

30 disorders in the form of pharmaceutical compositions. 

If the PRO polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 
antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the 

35 variable-region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the 
target protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 
technology. See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA . ^: 7889-7893 (1993). The 
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formulation herein may also contain more than one active compound as necessary for the particular indication 
being treated, preferably those with complementary activities that do not adversely affect each other. 
Alternatively, or in addition, the composition may comprise an agent that enhances its function, such as, for 
example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules 
are suitably present in combination in amounts that are effective for the purpose intended. 
5 The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 

techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and 
poly-<methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles, and nanocapsules) or inmacroemulsions. 
Such techniques are disclosed in Remington's Pharmaceutical Sciences , supra. 

10 The formulations to be used for in vivo administration must be sterile. This is readily accomplished 

by filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared . Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are 
in the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include 

15 polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate). or poly(vinylalcohoI)), polylactides 
(U.S. Pat. No. 3,773,919). copolymers of L-glutamicacid and y ethyl -L-glutamate, noh-degradable ethylene- 
vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT ™ (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- 
hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release 

20 of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated 
antibodies remain in the body for a long time, they may denature or aggregate as a result of exposure to 
moisture at 37''C, resulting in a loss of biological activity and possible changes in immunogenicity. Rational 
strategies can be devised for stabilization depending on the mechanism involved. For example, if the 
aggregation mechanism is discovered to be intermolecular S-S bond formation through thio-disulfide 

25 interchange, stabilization may be achieved by modifying sulfliydryl residues, lyophilizing from acidic solutions , 
controlling moisture content, using appropriate additives, and developing specific polymer matrix compositions. 

G. Uses for anti-PRO Antibodies 

The anti-PRO antibodies of the invention have various utilities. For example, anti-PRO antibodies 
30 may be used in diagnostic assays for PRO, e.g., detecting its expression in specific cells, tissues, or serum. 

Various diagnostic assay techniques known in the art may be used, such as competitive binding assays, direct 
or indirect sandwich assays and immunoprecipitation assays conducted in either heterogeneous or homogeneous 
phases [Zola, Monoclonal Antibodies: A Manual of Techniques . CRC Press, Inc. (1987) pp. 147-158]. The 
antibodies used in the diagnostic assays can be labeled with a detectable moiety . The detectable moiety should 
35 be capable of producing, either directly or indirectly, a detectable signal . For example, the detectable moiety 
may be a radioisotope, such as 'H, '*C, ^^P. ''S, or '"I, a fluorescent or chemiluminescent compound, such 
as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta- 
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galactosidase or horseradish peroxidase. Any method known in the art for conjugating the antibody to the 
detectable moiety may be employed, including those methods de^ribed by Hunter et al.. Nature . 144:945 
(1962); David et al., Biochemistry . 13:1014 (1974); Pain et al.. J, Immunol, Meth,, 40:219 (1981); and 
Nygren, J. Histochem. and Cvtochem. . 30:407 (1982). 

Anti-PRO antibodies also are useful for the affinity purification of PRO from recombinant cell culture 
or namral sources. In this process, the antibodies against PRO are immobilized on a suitable support, such 
a Sephadex resin or filter paper, using methods well known in the art. The immobilized antibody then is 
contacted with a sample containing the PRO to be purified, and thereafter the support is washed with a suitable 
solvent that will remove substantially all the material in the sample except the PRO, which is bound to the 
immobilized antibody. Finally, the support is washed with another suitable solvent that will release the PRO 
from the antibody. 

The following examples are offered for illustrative purposes only, and are not intended to limit the 
scope of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by 
reference in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following exan^les, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas. 



EXAMPLE 1 : Extracellular Domain Homology Screening to Identify Novel Polypeptides and cDNA Encoding 



The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 
950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. The 
EST databases included public databases (e.g., Dayhoff, GenBank), and proprietary databases (e.g. 
LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program 
BLAST or BLAST-2 (Altschul et al., Methods in Enzvmology 266:460-480 (1996)) as a comparison of the 
ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons with a BLAST 
score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled 
into consensus ON A sequences with the program "phrap" (Phil Green, University of Washington, Seattle, 
WA). 

Using this extracellular domain homology screen, consensus DNA sequences were assembled relative 
to the other identified EST sequences using phrap. In addition, the consensus DNA sequences obtained were 
often (but not always) extended using repeated cycles of BLAST or BLAST-2 and phrap to extend the 
consensus sequence as far as possible using the sources of EST sequences discussed above. 



VA, 



Therefor 
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Based upon the consensus sequences obtained as de^ribed above, oligonucleotides were then 
synthesized and used to identify by PCR a cDNA library that coJ|||^ed the sequence of interest and for use 
as probes to isolate a clone of the ftill-lengfli coding sequence for a PRO polypeptide. Forward and reverse 
PCR primers generally range from 20 to 30 nucleotides and are often designed to give a PCR product of about 
100-1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional 
5 oligonucleotides are synthesized when the consensus sequence is greater than about l-1.5kbp. In order to 
screen several libraries for a full-length clone. DNA from the libraries was screened by PCR amplification, 
as per Ausubcl et al.. Current Protocols in Molecular Biology , with the PCR primer pair. A positive library 
was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and one of the 
primer pairs. 

10 The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using 

commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oiigo dT containing a NotI site, linJced with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such 
as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al., 

1 5 Science . 253 : 1278-1280 ( 1 99 1 )) in the unique Xhol and Nod sites. 

EXAMPLE 2 : Isolation of cDNA clones by Amylase Screening 

1. Preparation of oligo dT primed cDNA library 

mRNA was isolated from a human tissue of interest using reagents and protocols from Invitrogen, San 
Diego, CA (Fast Track 2). This RNA was used to generate an oligo dT primed cDNA library in the vector 
pRK5D using reagents and protocols from Life Technologies, Gaithersburg, MD (Super Script Plasraid 
System). In this procedure, the double stranded cDNA was sized to greater than 1000 bp and the Sall/NotI 
linkered cDNA was cloned into XhoI/NotI cleaved vector. pRK5D is a clonmg vector that has an sp6 
transcription initiation site followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning 
sites. 

2. Preparation of random primed cDNA library 

A secondary cDNA library was generated in order to preferentially represent the 5' ends of the 
primary cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA 
30 was used to generate a random primed cDNA library in the vector pSST-AM Y.O using reagents and protocols 
from Life Technologies (Super Script Plasmid System, referenced above). In this procedure the double 
stranded cDNA was sized to 500-1000 bp, linkered with blunt to NotI adaptors, cleaved with Sfil, and cloned 
into Sfil/NotI cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase 
promoter preceding the cDN A cloning sites and the mouse amylase sequence (the mature sequence without the 
35 secretion signal) ifollowed by the yeast alcohol dehydrogenase terminator, after the cloning sites . Thus, cDN As 
cloned into this vector that are fused in frame with amylase sequence will lead to the secretion of amylase from 
appropriately transfected yeast colonies. 



20 
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^ 3. .Transformation and Detecrion 

* DNA from the library described in paragraph 2 above was chilled on ice to which was added 

elearocompetent DHIOB bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 
electroporated as recommended by the- manufacturer. Subsequently » SOC media (Life Technologies. 1 ml) 
was added and the mixture was incubated at 37 "C for 30 minutes. The transformants were then plated onto 
5 20 standard 150 mm LB plates containing ampicillin and incubated for 16 hours (37*C). Positive colonies 
were scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. 
CsCl-gradient. The purified DNA was dien carried on to the yeast protocols below. 

The yeast methods were divided into three categories: (1) Transformation of yeast with the 
plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PCR 
10 amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and 
further analysis. 

The yeast strain used was HD56-5A (ATCC-90785). This strain has the following genotype: MAT 
alpha, ura3-52, leu2-3, Ieu2-112, his3-ll, his3-15, MAL*, SUC+. GAL+. Preferably, yeast mutants can be 
employed that have deficient post-translational pathways. Such mutants may have translocation deficient alleles 
15 in secll, secl2, sec62, with truncated sec71 being most preferred. Alternatively, antagonists (including 
antisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins 
implicated in this post translation pathway (e.g., SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) 
or the complex formation of these proteins may also be preferably eiiq)loyed in combination with the amylase- 
expressing yeast. 

20 Transformation was performed based on the protocol outlined by Gietz et al., Nucl. Acid. Res. . 

20: 1425 (1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) 
and grown overnight at 30*C. The YEPD broth was prepared as described in Kaiser et al. , Methods in Yeast 
Generics . Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 207 (1994). The overnight culmre was then 
diluted to about 2 x 10* cells/ml (approx. ODmo=0. 1) into fresh YEPD broth (500 ml) and regrown to 1 x 10^ 

25 cells/ml (approx. OD«x) =0.4-0.5). 

The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in 
a Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile 
water, and centrifiiged again in 50 ml falcon mbes at 3,5(K) rpm in a Beckman GS-6KR centrifuge. The 
supernatant was discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCI, 

30 1 mM EDTA pH 7.5, 100 mM LijOOCCHj), and resuspended into LiAc/TE (2.5 ml). 

Transformation took place by mixing the prepared cells ( 100 /il) with freshly denamred single stranded 
salmon testes DNA (Lofstrand Labs, Gaithersburg, MD) and transforming DNA (1 ^g, vol. < 10 /xl) in 
microfiige tubes. The mixnire was mixed briefly by vortexing, then 40% PEG/TE (600 /il, 40% polyethylene 
glycol-4000, 10 mM Tris-HCI, 1 mM EDTA, 100 mM LijOOCCHj, pH 7.5) was added. This mixture was 

35 gently mixed and incubated at 30*C while agitating for 30 minutes. The cells were then heat shocked at 42 ''C 
for 15 minutes, and the reaction vessel centrifiiged in a microfuge at 12,000 rpm for 5-10 seconds, decanted 
and resuspended into TE (500 ^1, 10 mM Tris-HCI, I mM EDTA pH 7.5) followed by recentrifugation. The 
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. cells were then diluted into TE (1 ml) and aJiquots (200 /il) were spread onto the selective media previously 
prepared in 150 mm growth plates (VWR). 

Alternatively, instead of multiple small reactions, the transformation was performed using a single, 
large scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared 
5 as described in Kaiser et al. , Methods in Yeast Genetics . Cold Spring Harbor Press, Cold Spring Harbor, NY , 
p. 208-210 (1994). Transformants were grown at BO'C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective 
growth media. Starch was coupled to the red dye (Reactive Red-120, Sigma) as per the procedure described 
by Biely et al.. Anal. Biochem. . 172: 176-179 (1988). The coupled starch was incorporated into the SCD-Ura 
10 agar plates at a final concentration of 0.15% (w/v), and was buffered with potassium phosphate to a pH of 7.0 
(50-l(X) mM fmal concentration). 

The positive colonies were picked and streaked across fresh selective media (onto ISO nun plates) in 
order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase 
secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies 
15 were determined by their ability to break down starch resulting in a clear halo around the positive colony 
visualized directly. 

4. Isolation of DNA by PCR Amplification 

When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile 
20 water (30 /tl) in a 96 well plate. At this time, the positive colonies were either frozen and stored for 
subsequent analysis or immediately amplified. An aliquot of cells (5 fi\) was used as a template for the PCR 
reaction in a 25 ^tl volume containing: 0.5 /d Klentaq (Clontech, Palo Alto, CA); 4.0 /il 10 mM dNTP's 
(Perkin Elmer-Cetus); 2.5 fil Kentaq buffer (Clontech); 0.25 ^1 forward oligo I ; 0.25 fil reverse oligo 2; 12.5 
fii distilled water. The sequence of the forward oligonucleotide 1 was: 
25 5'-TGTAAAACGACGGCCAGT TAAATAGACCTGCAATTATTAATCT -3' (SEQ ID N0:3) 

The sequence of reverse oligonucleotide 2 was: 

5'-CAGGAAACAGCTATGAC CACCTGCACACCTGCAAATCCATr -3' (SEQ ID NO:4) 

PCR was then performed as follows: 



30 



a. 




Denature 


92*C, 


5 minutes 


b. 


3 cycles of: 


Denature 


92'C, 


30 seconds 






Anneal 


59°C. 


30 seconds 






Extend 


72"*C, 


60 seconds 


c. 


3 cycles of: 


Denature 


92**C. 


30 seconds 






Amieal 


57X, 


30 seconds 






Extend 


72"'C. 


60 seconds 


d. 


25 cycles of: 


Denature 


92-C, 


30 seconds 






Anneal 


55 ^C. 


. 30 seconds 






Extend 


72*C, 


60 seconds 
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Hold 4^ 



The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase 
region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present. 
Typically, the first 18 nucleotides of the 5* end of these oligonucleotides contained annealing sites for the 
5 sequencing primers. Thus, the total product of the PGR reaction from an empty vector was 343 bp. However, 
signal sequence-fused cDNA resulted in considerably longer nucleotide sequences. 

Following die PGR, an aliquot of the reaction (5 ftl) was examined by agarose gel electrophoresis in 
a 1 % agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al., supra . 
Clones resulting in a single strong PCR product larger than 400 bp were further analyzed by DNA sequencing 
10 after purification with a 96 Qiaquick PCR clean-up colunm (Qiagen Inc., Chaisworth, CA). 

EXAMPLE 3 : Isolation of cDNA Clones Using Signal Algorithm Analysis 

Various polypepiide-encoding nucleic acid sequences were identified by applying a proprietary signal 
sequence finding algorithm developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as 

15 clustered and assembled EST fragments from public (e.g., GenBank) and/or private (LIFESEQ®, Incyte 
Pharmaceuticals, Inc., Palo Alto, CA) databases. The signal sequence algorithm computes a secretion signal 
score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine 
codon(s) (ATG) at the 5 '-end of the sequence or sequence fragment under consideration. The nucleotides 
following the first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the 

20 first ATG has the required amino acids, the second is not examined. If neidier meets the requirement, the 
candidate sequence is not scored. In order to determine whether the EST sequence contains an authentic signal 
sequence, the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a 
set of seven sensors (evaluation parameters) known to be associated with secretion signals. Use of this 
algorithm resulted in the identification of numerous polypeptide-encoding nucleic acid sequences. 

25 

EXAMPLE 4 : Isolation of cDNA clones Encoding Human PR0281 

hi order to obtain a cDNA clone encoding PR0281, methods described in Klein et al., Proc. Natl. 
Acad. Sci. USA 93:7108-7 1 13 (1 996) were employed with the following modifications. Yeast transformation 
was performed with limiting amounts of transforming DNA in order to reduce the number of multiple 

30 transformed yeast cells. Instead of plasmid isolation from the yeast followed by transformation of E. colt as 
described in Klein et al . , supra . PCR analysis was performed on single yeast colonies. PCR primers employed 
were bipartite in order to amplify the insert and a small portion of the invertase gene (allowing to determine 
that the insert was in frame with invertase) and to add on universal sequencing primer sites. 

An invertase 1 ibrary was transformed into yeast and positives were selected on sucrose plates . Positive 

35 clones were re-tested and PCR products were sequenced. The sequence of one clone, PR0281, was 
determined to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed 
using the nucleotide sequence of PR0281 . A full length plasmid library of cDNAs from human umbilical vein 
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In light of an observed sequence homology between the DNA55728 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2860366, the Incyte EST clone 2860366 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 175 and is herein designated as DNA58855-I422. 

Clone DNA58855-1422 contains a single open reading frame with an apparent translational initiation 
5 site at nucleotide positions 3 1-33 and ending at the stop codon at nucleotide positions 718-720 (Figure 175). 
The predicted polypeptide precursor is 229 amino acids long (Figure 176). The full-length PR0994 protein 
shown in Figure 176 has an estimated molecular weight of about 25,109 daltons and a pi of about 6.83. 
Analysis of the full-length PR0994 sequence shown in Figure 176 (SEQ ID NO:258) evidences the presence 
of the following: transmembrane domains from about amino acid 10 to about amino acid 3 1 , from about amino 

10 acid 50 to about amino acid 72, from about amino acid 87 to about amino acid 1 10 and from about amino acid 
191 to about amino acid 213, potential N-glycosylation sites from about amino acid 80 to about amino acid 83, 
from about amino acid 132 to about amino acid 135, from about amino acid 148 to about amino acid 151 and 
from about amino acid 163 to about amino acid 166 and an amino acid block having homology to 
TNFR/NGFR cysteine-rich region proteins from about amino acid 4 to about amino acid 11. Clone 

15 DNA58855-1422 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203018. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 176 (SEQ ID NO:258), evidenced significant 
homology between the PR0994 amino acid sequence and the following Dayhoff sequences: AF027204_I, 
TAL6_HUMAN, ILT4_HUMAN, JC6205. MMU57570_1, S40363, ETU56093_1, S42858. P_R66849 and 

20 P_R74751. 

EXAMPLE 77 : Isolation of cDNA clones Encoding Human PR0812 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 170079. This EST cluster sequence'was then compared 

25 to a variety of expressed sequence tag (EST) databases which mcluded public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq*, hicyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 
(or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 

30 consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated as DNA55721. 

In light of an observed sequence homology between the DNA55721 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 388964, the Incyte EST clone 388964 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

35 The sequence of this cDNA insert is shown in Figure 177 and is herein designated as DNA59205-1421 . 

Clone DNA59205-1421 contains a single open readmg frame with an apparent translational initiation 
site at nucleotide positions 55-57 and ending at the stop codon at nucleotide positions 304-306 (Figure 177). 
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The predicted polypeptide precursor is 83 amino acids long (Figure 178). The full-length PROS 12 protein 
shown in Figure 178 has an estimated molecular weight of about 9.201 daltons and a pi of about 9.3. Analysis 
of the full-length PR0812 sequence shown in Figure 178 (SEQ ID NO:260) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15, a cAMP- and cGMP-dependent 
protein kinase phosphorylation site from about amino acid 73 to about amino acid 76 and protein kinase C 
5 phosphorylation sites from about amino acid 70 to about amino acid 72 and from about amino acid 76 to about 
amino acid 78. Clone DNA59205-1421 has been deposited with ATCC on June 23, 1998 and is assigned 
ATCC deposit no. 203009. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 178 (SEQ ID NO:260), evidenced significant 
10 homology between the PR0812 amino acid sequence and the following Dayhoff sequences: P_W35802, 
P_W35803, PSCl_RAT, S6823U GEN139n. PSC2_RAT, CC10_HUMAN, UTER_RABIT, AF008595_1 
and A56413. 

EXAMPLE 78 : Isolation of cDNA clones Encoding Human PRO1069 
15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence designated herein as 100727. This sequence was then compared to a proprietary EST 

DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 

homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods 

in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 
20 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA 

sequence witiithe program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus 

sequence obtained therefrom is herein designated DNA56001. 

In light of an observed sequence homology between the DNA56001 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no, 3533881 , the Incyte ESTclone 3533881 was purchased 
25 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 179 and is the full-length DNA sequence for PRO 1069. 

Clone DNA59211-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

209960. 

The entire nucleotide sequence of DNA5921 1-1450 is shown in Figure 179 (SEQ ID NO:261). Clone 
30 DNA592 U - 1 450 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 197-199 and ending at the stop codon at nucleotide positions 464-466. The predicted polypeptide 
precursor is 89 amino acids long (Figure 180). The full-length PRO1069 protein shown in Figure 180 has an 
estimated molecular weight of about 9,433 daltons and a pi of about 8.21. Analysis of the full-length 
PRO 1069 sequence shown in Figure 180 (SEQ ID NO:262) evidences the presence of the following features: 
35 a signal peptide sequence at amino acid 1 to about 16; a transmembrane domain at about amino acids 36 to 
about 59; potential N-myristoylation sites at about amino acids 4M6, 45-50, and 84-89; and homology with 
extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 at about ammo acids 54 to about 66. 
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177/330 
FIGURE 177 

GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCT 
GTCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 
GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTC 
CAAGTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 
CACCGATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGGAAATAGTGAA 
AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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178/330 

FIGURE 178 

MRLSVCLLMVSLALCCYQAHALVCPAVASEITVFLFLSDAAVNLQVAKLNPPPEALAAKLEV 
KHCTDQISFKKRLSLKKSWWK 
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131/330 
FIGURE 131 

MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 
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132/330 
FIGURE 132 

GGGGAATCTGCAGTAGGTCTGCCGGCGAT6GAGTGGTGGGCTAGCTCGCCGCTTCGGCTCTG 
GCTGCTGTTGTTCCTCCTGCCCTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAATGGA 
AAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGTCAAAAC 
TGCAGCTGCTACCATGGTGTCATAGAAGAGGATCTAACTCCTTTCCGAGGAGGCATCTCCAG 
GAAGATGATGGCAGAGGTAGTCAGACGGAAGCTAGGGACCCACTATCAGATCACTAAGAACA 
GACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTTGAGCACTTTATT 
TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTACGAGATTATCCTCA 
GGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGTAAGACATCAGAGTACC 
ATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCTGCTGTTTGGCCAATTTAT 
CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTGGTAAGGTCAGCAGCACAGTG 
GCCATGGAAAAAGAAAAACTCTACAGCATATTTCCGAGGATCAAGGACAAGTCCAGAACGAG 
ATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTTGTTGATGCAGAATACACCAAAAAC 
CAGGCCTGGAAATCTATGAAAGATACCTTAGGT^GCCAGCTGCTAAGGATGTCCATCTTGT 
GGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGTAGCTGCAAGTTTCCGGTTTA 
AACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCTAGAATTCTTC 
TATCCACAGCTGAAGCCATGGGTTCACTATATCCCAGTCAAAACAGATCTCTCCAATGTCCA 
AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAGGGGAA 
GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATCACCTGTTACTGGGAGAACCTCTTG 
AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATTAT 
TCCCAAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCA 
ACAGATCTCAGATATCCTACGGTGAGAAGCTTACCATAAGCTTGGCTCCTATACCTTGAATA 
TCTGCTATCAAGCCAAATACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGA 
AAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 
CCAGAAATCGTGAGATGTGGATTTTGAACCCAACTCTACCTTTCATTTTCTTAAGACCAATC 
ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCA 
TGTGATGATGCCCTTTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAA 
CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGG 
AAACCCTATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGG 
TAGCCATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAG 
GAGGTTTCTATAATGCCACATAGAAAGAGGCCAATTGCATGAGTAATTATTGCAATTGGATT 
TCAGGTTCCCTTTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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133/330 
FIGURE 133 

MEWWASSPLRLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLENYEPCSSQNCSCYHGVIE 
EDLTPFRGGISRKMMAEWRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPD 
MEMVINVRDYPQVPKWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDL 
FREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDT 
LGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVH 
YIPVKT.DLSNVQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 
N VT RRKG Y DQ 1 1 PKMLKT EL 
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134/330 
FIGURE 134 

CACCCCTCCATTTCTCGCC ATG GCCCCTGCACTGCTCCTGATCCCTGCTGCCCTCGCCTCTT 
TCATCCTGGCCTTTGGCACCGGAGTGGAGTTCGTGCGCTTTACCTCCCTTCGGCCACTTCTT 
GGAGGGATCCCGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGGCTGGCTGCCCTGCAGGA 
CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTGTTGGGCAGC 
ACAGCCTCATGGCAGCTGAAAGAGTGAAGGCATGGACATCCCGGTACTTTGGGGTCCTTCAG 
AGGTCACTGTATGTGGCCTGCACTGCCCTGGCCTTGCAGCTGGTGATGCGGTACTGGGAGCC 
CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGCTGAGCCATGGGCCACCTGGGTGCCGC 
TCCTCTGCTTTGTGCTCCATGTCATCTCCTGGCTCCTCATCTTTAGCATCCTTCTCGTCTTT 
GACTATGCTGAGCTCATGGGCCTCAAACAGGTATACTACCATGTGCTGGGGCTGGGCGAGCC 
TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACTCTTCTCCCACCTGCGCCACCCAGTGTGTG 
TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCTGGGCACGGACCGTCTCCTCCTTGCT 
TTCCTCCTTACCCTCTACCTGGGCCTGGCTCACGGGCTTGATCAGCAAGACCTCCGCTACCT 
CCGGGCCCAGCTACAAAGAAAACTCCACCTGCTCTCTCGGCCCCAGGATGGGGAGGCAGAGT 
GAGGAGCTCACTCTGGTTACAAGCCCTGTTCTTCCTCTCCCACTGAATTCTAAATCCTTAAC 
ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTGAAGGAGATGCCCCTT 
CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAGTTTCAGCCA 
CTGAACTCCAAGGTCCACTTCTCACCAGCAAGGAAGAGTGGGGTATGGAAGTCATCTGTCCC 
TTCACTGTTTAGAGCATGACACTCTCCCCCTCAACAGCCTCCTGAGAAGGAAAGGATCTGCC 
CTGACCACTCCCCTGGCACTGTTACTTGCCTCTGCGCCTCAGGGGTCCCCTTCTGCACCGCT 
GGCTTCCACTCCAAGAAGGTGGACCAGGGTCTGCAAGTTCAACGGTCATAGCTGTCCCTCCA 
GGCCCCAACCTTGCCTCACCACTCCCGGCCCTAGTCTCTGCACCTCCTTAGGCCCTGCCTCT 
GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTCCTGCTCTTAACTCGATGACTTGGGGCTC 
CCTGCTCTCCCGAGGAAGATGCTCTGCAGGAAAATAAAAGTCAGCCTTTTTCTAAAAAAAA 
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135/330 
FIGURE 135 

MAPALLLIPAALASFILAFGTGVEFVRFTSLRPLLGGIPESGGPDARQGWLAALQDRSILAP 
LAWDLGLLLLFVGQHSLMAAERVECAWTSRYFGVLQRSLYVACTALALQLVMRYWEPIPKGPV 
LWEARAEPWATWVPLLCFVLHVISWLLIFSILLVFDYAELMGLKQVYYHVLGLGEPLALKSP 
RALRLFSHLRHPVCVELLTVLWVVPTLGTDRLLLAFLLTLYLGLAHGLDQQDLRYLRAQLQR 
KLHLLSRPQDGEAE 

Signal sequence: 

amino acids 1-13 

Transmembrane domains : 

amino acids 58-76, 99-113, 141-159, 203-222 

N-myristoylation sites : 

amino acids 37-43, 42-48, 229-235 
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136/330 
FIGURE 136 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 
AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGT 
TTTAATTAAGCATGGAATACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATT 
CCACAGTTTTCTTAGCTCCCTGGACCCGGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 
TCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 
AGCTGCTGTGGAT6GCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 
CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCTCAGCCT 
TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 
ACAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTT 
CTGGTCATTCTGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 
TTGGGGTGAAAAAAAGTCTTGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAG 
AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 
GACATAATCCGACAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 
ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 
TTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 
TTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATAT 
TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 
TGTCCAGAGATTTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 
GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGA 
CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 
CAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTTTGGCAGGTCATGCTAAGGAACACC 
ACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAGGATACCTTGTGGAAA 
GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 
AACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTC 
AGGCCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAA 
GAAATTAATAGGACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGG 
TGTTACTGAGTTATAAGCTCACTAGGCTGTAAAAACA7\AACAATGTAGAGTTTTATTTATTG 
AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 
TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 
TGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATTATTATTTAAAATTA 
CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAGTGAAT 
CATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCAC 
TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 
ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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137/330 
FIGURE 137 

NIA3ALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQD 
FHFTLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEK 
EDKMLALSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFIN 
TGNLVKYLLNLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRD 
LVPRIYEMMGHVKPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHG 
FSSKEI ITFWQVMLRNTTCHY 
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138/330 

FIGURE 138 

CCTCTGTCCACTGCTTTCGTGAAGACAAGATGAAGTTCACAATTGTCTTTGCTGGACTTCTT 
GGAGTCTTTCTAGCTCCTGCCCTAGCTAACTATAATATCAACGTCAATGATGACAACAACAA 
TGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATGTTGACA 
ATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTGCTGCAACC 
AGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCATGCCCTCCAT 
TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGTAAGGGACCAGGAGGACCAC 
CTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACCTGAGCAAGTTCGGA 
AAAAACATTGCAAACATGTGTCGTGGGATTCCAACATACATGGCTGAGGAGATGCAAGAGGC 
AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGATTGTGGACATTT 
CCTTCTGTGGAGACACGGTGGAGAACTAAACAATTTTTTAAAGCCACTATGGATTTAGTCAT 
CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCATGTCATTCTGAAATT 
TTTCTCTACTAGTTATGTTTGATTTCTTTAAGTTTCAATAAAATCATTTAGCATTGAAAAAAA 
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139/330 
FIGURE 139 

MKFTIVFAGLLGVFLAPALANiKINVNDDNNNAGSGQQSVSVNNEHNVANVDNNNGWDSWNS 
IWDYGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVN 
PNKVDDLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 

Signal Peptide : 

amino acids 1-20 

N-nqfristoylation Sites: 

amino acids 67-72, 118-123, 163-168 



Flavodoxin protein homology: 

amino acids 156-174 
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140/330 
FIGURE 140 

CATTTCTGAAACTAATCGTGTCAGAATTGACTTTGAAAAGCATTGCTTTTTACAGAAGTATA 
TTAACTTTTTAGGAGTAATTTCTAGTTTGGATTGTAATATGAAATAATTTAAAAGGGCTTCG 
CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTGCTTACTGATTTTTT 
TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAA 
GAATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATG 
CAAGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGT 
TTGTTCGATTTCAACCAGAGACTATAGCATGTGCTTGCATCTACCTTGCAGCTAGAGCACTT 
CAGATTCCGTTGCCAACTCGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAAT 
CCAGGAAATCTGCATAGAAACACTTAGGCTTTATACCAGAT^AAAAGCCAAACTATGAATTAC 
TGGAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCAAATTAAAAGCAAAGGGA 
TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCCAGCCTCCAAGCC 
ATCATCACCAAGAGAAGTAAAAGCTGAAGAGAAATCACCAATCTCCATTAATGTGAAGACAG 
TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCTTACAATGGTGTAAGAAAA 
GACAGCAAGAGAAGTAGAAATAGCAGAAGTGCAAGTCGATCGAGGTCAAGAACACGATCACG 
TTCTAGATCACATACTCCAAGAAGACACTATAATAATAGGCGGAGTCGATCTGGAACATACA 
GCTCGAGATCAAGAAGCAGGTCCCGCAGTCACAGTGAAAGCCCTCGAAGACATCATAATCAT 
GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAAAAGTTCAAACAGACATGG 
TCATAAAAGGAAAAAATCTCGTTCTCGATCTCAGAGCAAGTCTCGGGATCACTCAGATGCAG 
CCAAGAAACACAGGCATGAAAGGGGACATCATAGGGACAGGCGTGAACGATCTCGCTCCTTT 
GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACATGGCAGGCACAGGCG 
CTGACTTTCTCTTCCTTTGAGCCTGCATCAGTTCTTGGTTTTGCCTATCTACAGTGTGATGT 
ATGGACTCAATCAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTGAAACCCTCTA 
GGTCTCTAGAACACTGAGGACAGTTTCTTTTGAAAAGAACTATGTTAATTTTTTTGCACATT 
AAAATGCCCTAGCAGTATCTJ\ATTAAAAACCATGGTCAGGTTCAATTGTACTTTATTATAGT 
TGTGTATTGTTTATTGCTATAAGAACTGGAGCGTGAATTCTGTAAAAATGTATCTTATTTTT 
ATACAGATAAAATTGCAGACACTGTTCTATTTAAGTGGTTATTTGTTTAAATGATGGTGAAT 
ACTTTCTTAACACTGGTTTGTCTGCATGTGTAAAGATTTTTACAAGGAAATAAAATACAAAT 
CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 
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141/330 
FIGURE 141 

MNDSLRTNVFVRFQPETIACACIYLAARALQIPLPTRPHWFLLFGTTEEEIQEICIETLRLY 
TRKKPNYELLEKEVEKRKVALQEAKLKAKGLNPDGTPALSTLGGFSPASKPSSPREVKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYSSRSRSRSRSHSESPRRHHNHGSPHLKAKHTRDDLKSSNRHGHKRKKSRSRSQ 
SKSRDHSDAAKKHRHERGHHRDRRERSRSFERSHKSKHHGGSRSGHGRHRR 
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142/330 
FIGURE 142 

TGGGGATAAAGGAAAAATGGTCAGGTATTAATGGCTTAAAGATTATTGGAAGGGGTTTATCA 
TTTTTTGAANNTATTCGGGTCANAATTGNCTTTGAAAAGCATTGCTTTTTACAGAAATATAT 
TANCTTTTTAGAGTAATTTCTAGTTTGGATTGTAATATGAAATTATTTAAAAGGGCTTCGCT 
CATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTNTTATTGCTTACTGATTTTTTTG 
AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 
ATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATGCA 
AGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGTTT 
GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCTACCTTGCAGNTAGAGCACTTCA 
GATTCCGTTGCCAACTNGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAMTCC 
AGGAAATNTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTACTG 
GAAAAAGAAGTAGAAAAAAGAA7\AGTAGCCTTACAAGAAGCCNAATTAAAAGCAAAGGGATT 
GAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCC 
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143/330 
FIGURE 143 

GGCACGAGGCCTCGTGCCrtAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTCATGGC 
GGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 
CACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 
TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 
GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCCGAGATGCCCCGTTCCAGCTGG 
AGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 
TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACTACAT 
GCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 
CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 
TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 
GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 
CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCTTCCTGTGGCCAAGCTGGCTATCCGCGTG 
GGACTGGCAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCCAGGCCTGCGGCTGGC 
CCAGACCCACCGGGACGCACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 
ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 
TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCGA 
CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGCCTGCTGCGGCTGGCGGTGACCCGGCCCC 
ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 
CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACCGTGGT 
GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACACTTCTGCTCAAGACGC 
TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 
AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTCCAGCAGACTGCAGCGCGGATTGCCGG 
GGCCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGCGTCCTGGCCTACCTCATCTGGT 
GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 
GGCTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 
CTAGCCTGCCCCCTCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 
GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 
GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 
TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 
GATTTTTAA 
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144/330 
FIGURE 144 

I^VLGVQLWTLLTATLMHRLAPHCSFARWLLCNGSLFRYKHPSEEELRALAGKPRPRGRKE 
RWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVYSGGVYLFTEAYY 
YMLGPAKETNIAVFWCLLTVTFSIKMFLTVTRLYFSAEEGGERSVCLTFAFLFLLLAMLVQV 
VREETLELGLEPGLASMTQNLEPLLKKQGWDWALPVAKLAIRVGLAVVGSVLGAFLTFPGLR 
LAQTHRDALTMSEDRPMLQFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 
FDSGRLWLLWLCLLRLAVTRPHLQAYLCLAKARVEQLRREAGRIEAREIQQRVVRVYCYVT 
WSLQYLTPLILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARI 
AGALGGLLTPLFLRGVLAYLIWWTAACQLLASLFGLYFHQHLAGS 
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145/330 
FIGURE 145 

CGTTNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 
CCTCATGCACAGGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTT 
TGTTCCGATACAAGCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 
CAGAGGCAGGAAAGAGCGGTGGGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCC 
GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 
TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 
CACAGAGGCCTACTACTACATGCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGT 
GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGC 
GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGC 
CATGCTGGTGCAAGGG 
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146/330 



FIGURE 146 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTT 
AATCTGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGC 
TTGGCTGGTTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGG 
CGCTTCTGTTGCTGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTG 
TATTCAGAACTCTGTAAAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTG 
TCCAGACGGCTGTGCGAGCCTCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAA 
TGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCA 
GTGGACTCTGGCCGGAGCAACCGAACTAGGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAA 
AAAAATAAATCGAGCTTTGAGTGTTCTTCGAAGGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGG 
GCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAA 
ATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAAGCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAAC 
CCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGGTGATCGCCAGAGACGGCCGGCTACTGCCAG 
GAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTCACAACTACGCTGTGCGTCTCCTGCGG 
CAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCAGCAGGAACAATGGACAGGCCCC 
GGATGCCTACAGACCCCGAGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCCCCGAGGAGCAGCTTGGAA 
TAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCGGTGTGGCATATCGA 
CATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCAGCCCAGAAAG 
TGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGCGGAGCC 
CTGACATCTTTCAGGAAGCCGGCTGGAACAGCAATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACACT 
CCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCT 
CGGCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCG 
GAGGAGTCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACA 
GAGGTCAGCCGGAGTGAGGCAGTGGCATTATTGAAAAGAACATCATCCTCGATAGTACTCAAAGCTTTGGAAGT 
CAAAGAGTATGAGCCCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCA 
GTGACTGGTCCCCATCCTGGGTCATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTA 
CGAAGAAACACAGCTGGAAGTCTGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCTTT 
TTTCATCAAATCCATTGTTGAAGGAACACCAGCATACAATGATGGAAGAATTAGATGTGGTGATATTCTTCTTG 
CTGTCAATGGTAGAAGTACATCAGGAATGATACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGA 
ATTACTCTAACTATTGTTTCTTGGCCTGGCACTTTTTTATAGAATCAATGATGGGTCAGAGGAAAACAGAAAAA 
TCACAAATAGGCTAAGAAGTTGAAACACTATATTTATCTTGTCAGTTTTTATATTTAAAGAAAGAATACATTGT 
AAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTTACACCTCAGAAAATATGATTCCAAAAAAATTA 
AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTACAACATTGTTTATATTTTTTCTATTCAAT 
AAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATTCAAGCTGATTTAAATTTAAAATTT 
GGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCATTTTTAATTTACAGCTAAAATATTTTTTAAAATGCA 
TTGCTGAGAAACGTTGCTTTCATCAAACAAGAATAAATATTTTTCAGAAGTTAAA 



PCT/USOO/08439 

147/330 
FIGURE 147 

MKALLLLVLPWL5FANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRS0DGCPDGCASLTAT 
APSPEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS 
FKKINRALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDP 
SESLSIRLVGGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLL 
RQPCQVLWLTVMREQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGV 
FIFNVLDGGVAYRHGQLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRS 
PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHRE 
WDLPIYVISVEPGGVISRDGRIKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEV 
KEYEPQEDCSSPAALDSNHNMAPPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIV 
GGYEEYNGNKPFFIKSIVEGTPAYNDGRIRCGDILLAVNGRSTSGMIHACLARLLKELKGRI 
TLTIVSWPGTFL 
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148/330 
FIGURE 148 

CCAAAGTGATCATTTGAfiAAAGAGATATCCACATCTTCAAGCCCATATAAAGGATAGAAGCT 
GCACAGGGCAGCTTTACTTACTCCAGCACCTTCCTCTCCCAGGCAAATGGTGCTGACCATCT 
TTGGGATACAATCTCATGGATACGAGGTTTTTAACATCATCAGCCCAAGCAACAATGGTGGC 
AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATACCGCCATCGTTAACATCCATGC 
AGGATCATGCTCTTCTACCACAATTTTTGACTATAAACATGGCTACATTGCATCCAGGGTGC 
TCTCCCGAAGAGCCTGCTTTATCCTGAAGATGGACCATCAGAACATCCCTCCTCTGAACAAT 
CTCCAATGGTACATCTATGAGAAACAGGCTCTGGACAACATGTTCTCCAACAAATACACCTG 
GGTCAAGTACAACCCTCTGGAGTCTCTGATCAAAGACGTGGATTGGTTCCTGCTTGGGTCAC 
CCATTGAGAAACXCTGCAAACATATCCCTTTGTATAAGGGGGAAGTGGTTGAAAACACACAT 
AATGTCGGTGCTGGAGGCTGTGCAAAGGCTGGGCTCCTGGGCATCTTGGGAATTTCAATCTG 
TGCAGACATTCATGTTTAGGATGATTAGCCCTCTTGTTTTATCTTTTCAAAGAAATACATCC 
TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGCCCCAACTAATTTTGAGATTC 
AGTCAGAAAATATAAATGCTGTATTTATA 
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149/330 
FIGURE 149 

MKILVAFLVVLTIFGIQSHGYEVFNIISPSNNGGNVQETVTIDISIEKNTAIVNIHAGSCSSTT 
IFDYKHGYIASRVLSRRACFILKMDHQNIPPLNNLQWYIYEKQALDNMFSNKYTWVKYNPLE 
SLIKDVDWFLLGSPIEKLCKHIPLYKGEWENTHNVGAGGCAKAGLLGILGIS.ICADIHV 



wo 00/73454 




PCT/USOO/08439 



150/330 
FIGURE 150 

GGCACGAGCCAGGAACT.AGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAGGC 
ATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGGCACG 
GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 
GCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCTATTCCCTC 
TGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGCCGGCCTCCTT 
CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCTGCCGGGCGTCCT 
CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 
CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGGCAGGCCCCAGGGTGGA 
GATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAAGGATG 
GGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCG 
AGCCAGACATCGGACTGGTTCTGGTGCCAGGCTGCAAACMCGCCAATGTCCAGCACAGCGC 
CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGA 
GCCCCATCCTTGCCTTGCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 
GGGTTCAGGATAGGGAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATGTAGAATGAACC 
GTCCAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGA 
GTTCATGCAAAATGAGTGTGTTTTAGCTGCTCTTGCCACAAAAAAAAATVAAAAAAAAAAAAA 
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151/330 

FIGURE 151 

MGLPGLFCIAVLAASSFSKAREEEITFVVSIAYK^/LEVFPKGRI^T^/LITCC-APQPPPPITYSL 
CGTKNIKVAKKVVKTHEPASFNLNVTLKSSPDLLTYFCRASSTSGAHVDSARLQMHWELWSK 
PVSELRANFTLQDRGAGPRVEMICQASSGSPPITNSLIGKDGQVHLQQRPCHRQPANFSFLP 
SQTSDWFWCQAANNANVQHSALTVVPPGGDQKMEDWQGPLESPILALPLYRSTRRLSEEEFG 
GFRIGNGEVRGRKAAAM 

Signal Peptide: 

amino acids 1-18 

N-glycosylation Sites: 

amino acids 86-89, 132-135, 181-184 
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152/330 
FIGURE 15 2 

GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTG 
CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGT 
CATCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGA 
CTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAA 
CTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACT 
TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCTGC 
AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGT 
TTCGATGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGMTGTGGACAACGGTTCATCC 
TGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATT 
ACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACC 
CTGGAGCCAAGTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCAC 
AGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCAtCCTCCCTGGCA 
TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACG 
GTCTTGATCAAACTCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGT 
GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTT 
TTGCCAACAATTTTACCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACC 
TGATGGAATTCCTGCACTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTC 
TCTTTTTGTTTGGAAAATCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATT 
GTCAGT7U\AATAATCACGTTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAG 
AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTAT 
TGTTCTGTACTGATATTTAAATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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153/330 
FIGURE 153 

t^AAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEKTFL 
HYDCGNKTVTPVSPLGKKLNVTTAWKAQNPVLREWDILTEQLRDIQLENYTPKEPLTLQAR 
MSCEQKAEGHS SGS WQFS FDGQ I FLL FDSEKRMWTT VH PGARKMKEKWEN DK WAMS FH YFS 
MGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILPCFILPGI 

Zmpor'ban'b featnires: 
Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 224-246 

H-glycosyla'tion site. 

amino acids 68-72, 82-86 

N-myristoylation site. 

amino acids 200-206, 210-216 



Amidation site. 

amino acids 77-81 
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154/330 
FIGURE 154 

GGGAAAGCCATTTCGAAAACCCATCTATACAAACTATATATTTTCATTTCTGCTGCTAGCTG 
CCTTGGGCCTCACAATTTTCATTCTGTTTTCTGACTTTCAAGTTATATACCGTGGAATGGAG 
TTGATCCCAACCATAACATCGTGGAGGGTTTTAATTTTGGTGGTAGCCCTCACCCAATTCTG 
GTGTGGCTTTCTTTGCAGAGGATTCCACCTTCAAAATCATGAACTCTGGCTGTTGATCAAAA 
GAGAATTTGGATTCTACTCTAAAAGTCAATATAGGACTTGGCAAAAGAAGCTAGCAGAAGAC 
TCAACCTGGCCTCCCATAAACAGGACAGATTATTCAGGTGATGGCAAAAATGGATTCTACAT 
CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAATTGGGAGGCCAAC 
CCACAGAACAGCATTTCTGGGCCAGGCTGTAaTCAGAATTGTCGTCGTACATGCTCAACAGC 
ATTGCTTTTTTCCCCAAAATTAACACATTGTGGAGAAGTGATGATACTCTCCCCTTACCTTT 
CCTCTCTCCATTCAAGCATTCAAAGTATATTTTCAATGAATTAAACCTTGCAGCAAGGGACC 
TTAGATAGGCTTATTCTGACTGTATGCTTTACCAATGAGAGAAAAAAATGCATTTCCTGTAT 
CATCCTTTTCAATAAACTGTATTCATTTTGAAAAAAAAAAAAAAAAAAAAAAA 
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155/330 
FIGURE 155 

MELIPTITSWRVLILWALTQFWCGFLCRGFHLQNKELWLLIKREFGFYSKSQYRTWQKKLA 
EDSTWPPINRTDYSGDGKNGFYINGGYESHEQIPKRKLKLGGQPTEQHFWARL 
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156/330 
FIGURE 156 

GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATGMCGTGCCACACCAT6AAG 
CTCTTGTGGCAGGTAACTGTGCACCACCACACCTGGAATGCCATCCTGCTCCCGTTCGTCTA 
CCTCACGGCGCAAGTGTGGATTCTGTGTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCC 
AGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCAGCAAGGTGGTGTGCACGCGCCGG 
GGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACCTCATGGAGAA 
CAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGT 
TGGGCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAAC 
ACCCTGGAGCTGTTCGACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTC 
CAAGCTGCGGGAGCTCTGGCTTCGCAACAACCCCATCGAAAGCATCCCCTCTTACGCCTTCA 
ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCAAGAAGCTGGAGTATATCTCT 
GAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCAACATTAA 
AGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACT 
TCCCTGAGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATG 
AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACT 
CAACTTGGCCCACAATAACCTCTCTTCTTTGCCCCATGACCTCTTTACCCCGCTGAGGTACC 
TGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTGACATTCTGTGGCTAGCC 
TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTCCCAT 
GCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCT 
TCATCATGGACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGT 
CGGACTCCCCCTATGTCCTCCGTGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGC 
CTCCCGCCACCCAAGGATCTCTGTCCTCAACGACGGCACCTTGAACTTTTCCCACGTGCTGC 
TTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCAACTCCAACGCCTCG 
GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAGT 
AACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTA 
CCACGTCCACTGGTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACC 
CGTGTGCCCAAGCAGGTGGCAGTACCCGCGACAGACACCACTGACAAGATGCAGACCAGCCT 
GGATGAAGTCATGAAGACCACCAAGATCATCATTGGCTGCTTTGTGGCAGTGACTCTGCTAG 
CTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGCGGAGTACAGTC 
ACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGC 
AGCAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTC 
ATGACCATATTAACTACAACACCTACAAACCAGCACATGGGGCCCACTGGACAGAAAACAGC 
CTGGGGAACTCTCTGCACCCCACAGTCACCACTATCTCTGAACCTTATATAATTCAGACCCA 
TACCAAGGACAAGGTACAGGAAACTCAAATATGACTCCCCTCCCCCAAAAAACTTATAAAAT 
GCAATAGAATGCACACAAAGACAGCMCTTTTGTACAGAGTGGGGAGAGACTTTTTCTTGTA 
TATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGA 
CAAAAAGTCAAAACA 
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157/330 
FIGURE 157 

MKLLWQVTVHHHTWNAILLPFVYLTAQVWILCAAIAAAASAGPQNCPSVCSCSNQFSKWCT 
RRGLSEVPQGIPSNTRYLNLMENNIQMIQADTFRHLHHLEVLQLGRNSIRQIEVGAFNGLAS 
LNTLELFDNWLTVIPSGAFEYLSKLRELWLRNNPIESIPSYAFNRVPSLMRLDLGELKKLEY 
ISEGAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLW 
VMNSQVSLIERNAFDGLASLVELNLAHNNLSSLPHDLFTPLRYLVELHLHHNPWNCDCDILW 
LAWWLREYIPTNSTCCGRCHAPMHMRGRYLVEVDQASFQCSAPFIMDAPRDLNISEGRMAEL 
KCRTPPMSSVKWLLPNGTVLSHASRHPRISVLNDGTLNFSHVLLSDTGVYTCMVTNVAGNSN 
ASAYLNVSTAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYTTSTTVLIQ 
TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKIIIGCFVAVTLLAAAMLIVFYKLRKRHQQRS 
TVTAARTVEIIQVDEDIPAATSAAATAAPSGVSGEGAVVLPTIHDHINYNTYKPAHGAHWTE 
NSLGNSLHPTVTTISEPYIIQTHTKDKVQETQI 
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158/330 



FIGURE 158 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCC 
TTCTCATCTCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATG 
TGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCAT 
CCCTTGCTGCAGGAATGAGGAGAATGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACT 
GCAAGAGCTGCCGAAATGGCTCATGGGGGGGTACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAG 
TGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTGGCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTT 
GTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGCTAAACCTGGGTTTGTCATCCAACTAA 
GATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAGGTTCGTGATGGAGACAAC 
CGCGATGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAGGATCCTCACT 
CCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGCAT 
GCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTG 
GCAGGCTATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCAGACCCTGGGGGCCCAGTCAA 
TGGGTACCAGAAAATAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTT 
TCTTTTGTAACAACTCCTATGTTCTTAGTGGCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGG 
AAACAGCCCATCTGCATAAAAGCCTGCCGAGAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGAT 
GCAGGTTCAGTCAAGGGAGACACCATTACACCAGCTATACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTG 
CCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGGATACCAACATCTGCATACCCAGCTCCAG 
TATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGTCTGAGGACTGGGAAGTGGAG 
TGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAAAATTGAGAACATCACTGCTCCAAAGACCCAAGGGTTGC 
GCTGGCCGTGGCAGGCAGCCATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGCGTGG 
TTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGG 
GAAGGTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGGGATGATGACCGGGATG 
AGAAGACCATCCAGAGCCTACAGATTTCTGCTATCATTCTGCATCCCAACTATGACCCCATCCTGCTTGATGCT 
GACATCGCCATCCTGAAGCTCCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAG 
TCGGGATCTCAGCACTTCCTTCCAGGAGTCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGA 
GCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGGTGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGCAG 
CATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTTCTGTGCCAGCTGGGAACCCACTGCCCCTTC 
TGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGACGAGCATCTCCTGAGCCACGCT 
GGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCACTGCCTTCACCAAG 
GTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAGTGTTTC 
TGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGG 
CTGTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGAT 
GCCGCGTCCACTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAGAAGTAAGTTTCTTCAAAGAAGACC 
ATATAC7VAAACCTCTCCACTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTG 
ACCAGGGAAGATCTGGGCTTCATGAGGCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCC 
CAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTTTGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCC 
CCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCTGAACTACAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAA/WVW^AAAAAAAAAAAAAAAAAAAAAAAA 
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159/330 
FIGURE 159 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREW 
GYTIPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGD 
CMRCGQVLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRD 
GDNRDGQIIKRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEXTACSSSPCFHDG 
TCVLDKAGSYKCACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 
VSFFCNNSYVLSGNEKRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLH 
QLYS AAFS KQKLQSAPTKKPAL P FGDLPMG YQHLHTQLQYEC I S PFYRRLGSSRRTCLRTGK 
WSGRAPSCIPICGKIENITAPKTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNE 
RTVWAAHCVTDLGKVTMIKTADLKVVLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLD 
ADIAILKLLDKARISTRVQPICLAASRDLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSG 
VVSWDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 
WHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWIERNMK 
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160/330 
FIGURE 160 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGA 
AGCTTTCTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACMGGGC 
TTCAACCTGACTTTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCT 
GACAGTGGTTGGGTGGGCCACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAG 
CAAAGGAGTTCATGGCTAATTTCCATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACT 
AATGAAGCATCCACGAAGAAGGTAGAACTTGACAACTGTCCTTCTGTGTCTCCTTACCTCAG 
AGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCACTTTGGAAGAGGTACAGGCAGAAAATC 
CCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAGCTTTACAGAGGGTCGCCATC 
CTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAACATCTGCATCCCTT 
CCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGGTAAAAAGT 
TTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGGAC 
TGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGA 
GGAGCATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTG 
- GATATTTTGGGGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCT 
AACAACTACTGGGGATGGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAG 
AATGAAAATTTCCCGGCCCCTGCCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAG 
ACAAAGGCAATGAGGTGAACGCAGAACGGATGAAGCTCTTACACCAAGTGTCACGAGTCTGG 
AGAACAGATGGGTTGAGTAGTTGTTCTTATAAATTAGTATCTGTGGAACACAATCCTTTATA 
TATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCCTGGATCTTTTGGTGATGTTTGG 
AAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAAATAGTAGCACACATTA 
AGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCTTAGCAGAGCT 
CCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGATCATG 
AGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGAT 
AAAATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATAT 
TATGGGATAAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCT 
CGTCCAAGGTAGAAAGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCT 
GTGAAGTGGTGGTGTCAGGTGAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCA 
GGACACAGTGAACTTGGGAATGAAGAGGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAG 
CAGTAGCTGAGCTGGTTGCAGGTGCTGATAGCCTTCAGGGGAGGACCTGCCCAGGTATGCCT 
TCCAGTGATGCCCACCAGAGMTACATTCTCTATTAGTTTTTAAAGAGTTTTTGTAAAATGA 
TTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACATATTAACTAATAATAAATA 
TGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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161/330 
FIGURE 161 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKT 
LTNEASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRV 
AILVPHRNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEEN 
WDCFIFHDVDLVPENDFNLYKCEEHPKHLVVGRNSTGYRLRYSGYFGGVTALSREQFFKVNG 
FSNNYWGWGGEDDDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSR 
VWRTDGLSSCSYKLVSVEHNPLYINITVDFWFGA 

In^ortan't features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites : 

amino acids 4-7, 220-223 and 335-338 



Xylose isomerase proteins: 

amino acids 191-201 
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162/330 
FIGURE 162 

CGTGGGCCGGGGTCGCGCAGCGGGCTGTGGGCGCGCCCGGAGGAGCGACCGCCGCAGTTCTC 
GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCTCCCGCTCCGGGCCCCGCAMG 
GCCCAGGCAGTGTGGTCGCGCCTCGGCCGCATCCTCTGGCTTGCCTGCCTCCTGCCCTGGGC 
CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCACCACCGATAGCCCTGCCACCA 
CGGGAGCGGTGGTGACCATCTCGGCCAGCCTGGTGGCCAAGGACAACGGCAGCCTGGCCCTG 
CCCGCTGACGCCCACCTCTACCGCTTCCACTGGATCCACACCCCGCTGGTGCTTACTGGCAA 
GATGGAGAAGGGTCTCAGCTCCACCATCCGTGTGGTCGGCCACGTGCCCGGGGAATTCCCGG 
TCTCTGTCTGGGTCACTGCCGCTGACTGCTGGATGTGCCAGCCTGTGGCCAGGGGCTTTGTG 
GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCTTGTTGTCACCCAGAACACTTCCCTACC 
CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCTCCACGACCCGA 
GCAACTTCCTCAAGACCGCCTTGTTTCTCTACAGCTGGGACTTCGGGGACGGGACCCAGATG 
GTGACTGAAGACTCCGTGGTCTATTATAACTATTCCATCATCGGGACCTTCACCGTGAAGCT 
CAAAGTGGTGGCGGAGTGGGAAGAGGTGGAGCCGGATGCCACGAGGGCTGTGAAGCAGAAGA 
CCGGGGACTTCTCCGCCTCGCTGAAGCTGCAGGAAACCCTTCGAGGCATCCAAGTGTTGGGG 
CCCACCCTAATTCAGACCTTCCAAAAGATGACCGTGACCTTGAACTTCCTGGGGAGCCCTCC 
TCTGACTGTGTGCTGGCGTCTCAAGCCTGAGTGCCTCCCGCTGGAGGAAGGGGAGTGCCACC 
CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCAGGGACCCTGGGGACTAC 
TGCTTCAGCATCCGGGCCGAGAATATCATCAGCAAGACACATCAGTACCACAAGATCCAGGT 
GTGGCCCTCCAGAATCCAGCCGGCTGTCTTTGCTTTCCCATGTGCTACACTTATCACTGTGA 
TGTTGGCCTTCATCATGTACATGACCCTGCGGAATGCCACTCAGCAAAAGGACATGGTGGAG 
AACCCGGAGCCACCCTCTGGGGTCAGGTGCTGCTGCCAGATGTGCTGTGGGCCTTTCTTGCT 
GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCTCCCGCCCCTCT 
ATAAGTCTGTCAAAACTTACACCGTGTGAGCACTCCCCCTCCCCACCCCATCTCAGTGTTAA 
CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCACTGACCAGGAGGGGTTCATT 
TGCGTGGGGCTGTTGGCCTGGATCATCCATCCATCTGTACAGTTCAGCCACTGCCACAAGCC 
CCTCCCTCTCTGTCACCCCTGACCCCAGCCATTCACCCATCTGTACAGTCCAGCCACTGACA 
TAAGCCCCACTCGGTTACCACCCCCTTGACCCCCTACCTTTGAAGAGGCTTCGTGCAGGACT 
TTGATGCTTGGGGTGTTCCGTGTT.GACTCCTAGGTGGGCCTGGCTGCCCACTGCCCATTCCT 
CTCATATTGGCACATCTGCTGTCCATTGGGGGTTCTCAGTTTCCTCCCCCAGACAGCCCTAC 
CTGTGCCAGAGAGCTAGAAAGAAGGTCATAAAGGGTTAAAAATCCATAACTAAAGGTTGTAC 
ACATAGATGGGCACACTCACAGAGAGAAGTGTGCATGTACACACACCACACACACACACACA 
CACACACACACAGAAATATAAACACATGCGTCACATGGGCATTTCAGATGATCAGCTCTGTA 
TCTGGTTAAGTCGGTTGCTGGGATGCACCCTGCACTAGAGCTGAAAGGAAATTTGACCTCCA 
AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTGTCTCTGCAGTTCTTGC 
GCCCTTTATAAGGCCATCCTAGTCCCTGCTGGCTGGCAGGGGCCTGGATGGGGGGCAGGACT 
AATACTGAGTGATTGCAGAGTGCTTTATAAATATCACCTTATTTTATCGAAACCCATCTGTG 
AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAAGGCCGGGCGCGG 
TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGATCAGGA 
GATCGAGACCACCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAAGTT 
AGCCGGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATG 
GTGCGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCAGATGGCGCCACTGCACTCCAGCCTGA 
GTGACAGAGCGAGACTCTGTCTCCA 
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163/330 
FIGURE 163 

MAQAVWSRLGRILWLACLLPWAPAGVAAGLYELNLTTDSPATTGAWTISASLVAKDNGSLA 
LPADAHLYRFHWIHTPLVLTGKMEKGLSSTIRWGHVPGEFPVSVWVTAADCWMCQPVARGF 
WLPITEFLVGDLWTQNTSLPWPSSYLTKTVLKVSFLLHDPSNFLKTALFLYSWDFGDGTQ 
MVTEDSWYYNYSIIGTFTVKLECWAEWEEVEPDATRAVKQKTGDFSASLKLQETLRGIQVL 
GPTLIQTFQKMTVTLNFLGSPPLTVCWRLKPECLPLEEGECHPVSVASTAYNLTHTFRDPGD 
YCFS I RAEN 1 1 SKTHQYHKIQVWPSR IQPAVFAFPCATLITVMLAFIMYMTLRNATQQKDMV 
ENPEPPSGVRCCCQMCCGPFLLETPSEYLEIVRENHGLLPPLYKSVKTYTV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 339-362 

N-glycosylation sites. 

amino acids 34-37, 58-61, 142-145, 197-200, 300-303 and 364-367 
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164/330 
FIGURE 164 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACG^TGGCACTGAGCTCCCAGATCT 
GGGCCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCA 
CAACAGACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTG 
GATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCT 
GCGGCTGCTGTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTG 
CCCCCGTCCCCTCCCTTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAA 
TGGCTGGTTCTTTTGTTTTCCAAAAAAAAAAAAAAA7UVAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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165/330 
FIGURE 165 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTH 
FPICIFCCGCCHRSKCGMCCKT 
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166/330 
FIGURE 166 

-CTGTCAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAGGGAGGCAGGTGCTGGCCTGGC 
CTGGATCTTCCACCATGTTCCTGTTGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGC 
ATCTCCCTGACTGTCCTCTTCACCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGG 
AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTA 
CCTTGAGAATGGAGCGAGGAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAAC 
GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGG 
TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCC 
GGAAAGGAATGGAGACCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTG 
GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCTTCGGCTCAC 
GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTTCTGCTGCCGCTCAGGATAGCAC 
TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACTGTGGTGGGATACTTGCCAAAT 
GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTTACCGGATCTGCGTGCG 
AGCGCTGACAGCCATCATCACCTACCATGACAGGGAAAACAGACCAAGAAATGGTGGCATCT 
GTGTGGCCAATCATACCTCACCGATCGATGTGATCATCTTGGCCAGCGATGGCTATTATGCC 
ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGAGCCATGGTGAAGGCCTG 
CCCACACGTCTGGTTTGAGCGCTCGGAAGTGAAGGATCGCCACCTGGTGGCTAAGAGACTGA 
CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTCATCTTCCCAGAAGGAACCTGCATC 
AATAATACATCGGTGATGATGTTCAAAAAGGGAAGTTTTGAAATTGGAGCCACAGTTTACCC 
TGTTGCTATCAAGTATGACCCTCAATTTGGCGATGCCTTCTGGAACAGCAGCAAATACGGGA 
TGGTGACGTACCTGCTGCGAATGATGACCAGCTGGGCCATTGTCTGCAGCGTGTGGTACCTG 
CCTCCCATGACTAGAGAGGCAGATGAAGATGCTGTCCAGTTTGCGAATAGGGTGAAATCTGC 
CATTGCCAGGCAGGGAGGACTTGTGGACCTGCTGTGGGATGGGGGCCTGAAGAGGGAGAAGG 
TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTGTACAGCAAGATGATCGTGGGGAACCAC 
AAGGACAGGAGCCGCTCCTGAGCCTGCCTCCAGCTGGCTGGGGCCACCGTGCGGGGTGCCAA 
CGGGCTCAGAGCTGGAGTTGCCGCCGCCGCCCCCACTGCTGTGTCCTTTCCAGACTCCAGGG 
CTCCCCGGGCTGCTCTGGATCCCAGGACTCCGGCTTTCGCCGAGCCGCAGCGGGATCCCTGT 
GCACCCGGCGCAGCCTACCCTTGGTGGTCTAAACGGATGCTGCTGGGTGTTGCGACCCAGGA 
CGAGATGCCTTGTTTCTTTTACAATAAGTCGTTGGAGGAATGCCATTAAAGTGAACTCCCCA 
CCTTTGCACGCTGTGCGGGCTGAGTGGTTGGGGAGATGTGGCCATGGTCTTGTGCTAGAGAT 
GGCGGTACAAGAGTCTGTTATGCAAGCCCGTGTGCCAGGGATGTGCTGGGGGCGGCCAGCCG 
CTCTCCAGGAAAGGCACAGCTGAGGCACTGTGGCTGGCTTCGGCCTCAACATCGCCCCCAGC 
CTTGGAGCTCTGCAGACATGATAGGAAGGAAACTGTCATCTGCAGGGGCTTTCAGCAAAATG 
AAGGGTTAGATTTTTATGCTGCTGCTGATGGGGTTACTAAAGGGAGGGGAAGAGGCCAGGTG 
GGCCGCTGACTGGGCCATGGGGAGAACGTGTGTTCGTACTCCAGGCTAACCCTGAACTCCCC 
ATGTGATGCGCGCTTTGTTGAATGTGTGTCTCGGTTTCCCCATCTGTAATATGAGTCGGGGG 
GAATGGTGGTGATTCCTACCTCACAGGGCTGTTGTGGGGATTAAAGTGCTGCGGGTGAGTGA 
AGGACACATCACGTTCAGTGTTTCAAGTACAGGCCCACAAAACGGGGCACGGCAGGCCTGAG 
CTCAGAGCTGCTGCACTGGGCTTTGGATTTGTTCTTGTGAGTAAATAAAACTGGCTGGTGAA 
TGA 
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167/330 
FIGURE 167 

MFLLLPFDSLIVNLLGISLTVLFTLLLVFIIVPAIFGVSFGIEKLYMKSLLKIFAWATLRME 
RGAKEKNHQLYKPYTNGIIAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 
TIMDDEVTKRFSAEELESWNLLSRTNYNFQYISLRLTVLWGLGVLIRYCFLLPLRIALAFTG 
ISLLWGTTWGYLPNGRFKEFMSKHVHLMCYRICVRALTAIITYHDRENRPRNGGICVANH 
TSPIDVIILASDGYYAMVGQVHGGLMGVIQRAMVKACPHVWFERSEVKDRHLVAKRLTEHVQ 
DKSKLPILIFPEGTCINNTSVMMFKKGSFEIGATVYPVAIKYDPQFGDAFWNSSKYGMVTYL 
LRMMTSWAIVCSVWYLPPMTREADEDAVQFANRVKSAIARQGGLVDLLWDGGLKREKVKDTF 
KEEQQKLYSKMIVGNHKDRSRS 
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168/330 
FIGURE 168 

GCCCCTCGAAACCAGGACTCCAGCACCTCTGGTCCCGCCCTCACCCGGACCCCTGGCCCTCA 
CGTCTCCTCCAGGGATgGCGCTGGCGGCTTTGATGATCGCCCTCGGCAGCCTCGGCCTCCAC 
ACCTGGCAGGCCCAGGCTGTTCCCACCATCCTGCCCCTGGGCCTGGCTCCAGACACCTTTGA 
CGATACCTATGTGGGTTGTGCAGAGGAGATGGAGGAGAAGGCAGCCCCCCTGCTAAAGGAGG 
AAATGGCCCACCATGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAGACCTGGGAGGAC 
AAGCGTCGAGGGCTTACCTTGCCCCCTGGCTTCAAAGCCCAGAATGGAATAGCCATTATGGT 
CTACACCAACTCATCGAACACCTTGTACTGGGAGTTGAATCAGGCCGTGCGGACGGGCGGAG 
GCTCCCGGGAGCTCTACATGAGGCACTTTCCCTTCAAGGCCCTGCATTTCTACCTGATCCGG 
GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCTGGGGAGGTGGTGTTCCG 
AGGTGTGGGCAGCCTTCGCTTTGAACCCAAGAGGCTGGGGGACTCTGTCCGCTTGGGCCAGT 
TTGCCTCCAGCTCCCTGGATAAGGCAGTGGCCCACAGATTTGGGGAGAAGAGGCGGGGCTGT 
GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCTCTCTGCCCCCCTG 
GAAGACTCTGCTCTTGGCCCCTGGAGAGTTCCAGCTCTCAGGGGTTGGGCCCTGAAAGTCCA 
ACATCTGCCACTTAGGAGCCCTGGGAACGGGTGACCTTCATATGACGAAGAGGCACCTCCAG 
CAGCCTTGAGAAGCAAGAACATGGTTCCGGACCCAGCCCTAGCAGCCTTCTCCCCAACCAGG 
ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCTATGTGATGGGGACTTCCT 
GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAACGGTGTTGCAATGTGGAGACA 
TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 
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169/330 

jxiywKE -Lb" 

MALAALMIALGSLGLHTWQAQAVPTILPLGLAPDTFDDTYVGCAEEMEEKAAPLLKEEMAHH 
ALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSNTLYWELNQAVRTGGGSREL 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEVVFRGVGSLRFEPKRLGDSVRLGQFASSS 
LDKAVAHRFGEKRRGCVSAPGVQLGSQSEGASSLPPWKTLLLAPGEFQLSGVGP 
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170/330 
FIGURE 170 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCA 
CCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTG 
GTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTC 
TATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCA 
TAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAG 
CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACT 
CCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAG 
TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATG 
GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTC 
CCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATATGACCTTCATCT 
GCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCCATCCTTGCCAGGAAGCTCTGT 
GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCTGTGTCTCCTGTTGGTGCCCCT 
CCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTCTGAAGAGAGAGAGACAAGAAG 
AGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAAACTCCTAACATATGCCCCCAT 
TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAATCCTAAAGGAAGA 
TCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGATGGAAAATCCCCACTCAC 
TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTATCTAGACAGCAGTG 
CACTCCCCTAAGTCTCTGCTCA 
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171/330 

FIGURE 171 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVT 
IQPEGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHV 
YEHLSKPKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRW 
GESDMTFICVARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLW 
FLKRERQEEYIEEKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIP 
KKMEN PH SLLTM P DT PRL FAY EN VI 
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172/330 
FIGURE 172 

CTGGTTCCCCAACATGCCTCACCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCC 
TCTGGACCCGTGAAAGAGCTGGTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTC 
CAAAGTAAAGCAAGTTGACTCTATTGTCtGGACCTTCAACACAACCCCTCTTGTCACCATAC 
AGCCAGAAGGGGGCACTATCATAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCA 
GATGGAGGCTACTCCCTGAAGCTCAGC7VAACTGAAGAAGAATGACTCAGGGATCTACTATGT 
GGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACG 
AGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTG 
ACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCT 
GGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAG 
AAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCC 
ATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCT 
GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTC 
TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAA 
ACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAA 
TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA 
AGATGGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAG 
AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCAAAAAAAAAAAAAAAAAAA 
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173/330 
FIGURE 173 

GAAAGACGTGGTCCTGACAGACAGACAATCCTATTCCCTACCAAAATGAAGATGCTGCTGCT 
GCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGAAGGA 
ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAATU^GA 
GAAAAGATAGAAGAACATGGCAACTTTAGACTTTTTCTGGAGCAAATCCATGTCTTGGAGAA 
TTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGTGCTCCGAATTATCTATGGTTG 
CTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTCAATACATTTACT 
ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 
AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCAGACATCAAGGAAA 
GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCAAT 
GCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAC 
ACTTCTCACCAGGACTCCACCATCATCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 
TGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCAATTCCAGTCTATCAACATGTTACC 
TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 
TTTTCATGAAATTATTCCTCTTCCTGTTCAATAAATGATTACCCTTGCACTTAA 
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174/330 
FIGURE 174 

MKMLLLLCLGLTLVCVHAEEASSTGRNFNVEKINGEWHTIILASDKREKIEEHGNFRLFLEQ 
IHVLENSLVLKVHTVRDEECSELSMVADKTEKAGEYSVTYDGFNTFTIPKTDYDNFLMAHLI 
NEKDGETFQLMGLYGREPDLSSDIKERFAQLCEEHGILRENIIDLSNANRCLQARE 
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175/330 



FIGURE 175 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAA 
TGGATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 
TCAGCTTAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTC 
CCAGGAATTATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAG 
AAAAAGAGCGTGCTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCA 
CAGTCATTGGTGCTCTGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTC 
ATGTGTAATTCTCCAAGCAACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGA 
CATTCATCCAGAATCCTTCAACTTGGAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTG 
GTTTCAATAAACCCACCAGTAACGACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCAC 
TTCGATTCTGAAGATUU^CAAACATAGGCTTATCCACTTCTCAGTATTTTTAGGTCTATTGCT 
TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTGGCTGTC 
TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGTAGTTTAATGGGAATAAAATGTAAGTA 
TCAGTAGTTTGAAAAAAAAAAA 
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176/330 
FIGURE 176 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMA 
IPATTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNA 
NCEFSLKNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRL 
IHFSVFLGLLLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 
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177/330 
FIGURE 177 

GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCT 
GTCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 
GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTC 
CAAGTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 
CACCGATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGGAAATAGTGAA 
AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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178/330 
FIGURE 17S 

MRLSVCLLMVSLALCCYQAHALVCPAVASEITVFLFLSDAAVNLQVAKLNPPPEALAAKLEV 
KHCTDQISFKKRLSLKKSWWK 
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179/330 

FIGURE 179 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 
TGAAGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTG 
GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTC 
GAACTGTGACAT6GAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 
AAGCCAATGACCCATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAG 
CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 
CAAATGCAAATACAAGAGCAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCAC 
TCATCACTCCAGGCTCTGCCACTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGA 
AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 
TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 
CCCCACCACCCCCTCA 
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180/330 
FIGURE 180 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCK 
YKSSQKQHSPVPEKAIPLITPGSATTC 
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181/330 

FIGURE 181 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGAT6TCGCTGCTGAGCCTGCCC 
TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 
CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 
AGTGTTTCCCACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 
ACAGAGGAGGGCTTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGT 
ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 
CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 
CTGGGAGAAGGGATACTGCTGAGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 
GCCCGCCTTCCATTTCAACATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 
TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 
CACATCAGCCTCATGACCTTGGACAGTCTACAGAAATGCATCTTCAGCTTTGACAGCCATTG 
TCAGGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 
GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTCTCCCATGACGGGCGGCGC 
TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 
CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 
ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 
ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 
CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 
AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 
TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 
ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 
TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 
CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTC 
CGCAGGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGG 
CGTTGATGCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAA 
TTGATCATGCGCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCA 
GTGACTTTCTGACCCATCCACCTGTTTTTTTGCAGATTGTCATGAATATWVCGGTGCTGTCAAA 
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182/330 
FIGURE 182 

MSLLSLPWLGLRPVAMSPWLLLLLVVGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWG 
HLGLITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLF 
IRFLKPWLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGS 
SRLDMFEHISLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYY 
LSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDG 
KALSDEDIRAEADTFMFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEW 
DDLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVW 
PDPEVYDPFRFDPENSKGRSPLAFIPFSAGPRNCIGQAFAiyiAEMKVVLALMLLHFRFLPDHT 
EPRRKLELIMRAEGGLWLRVEPLNVGLQ 
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183/330 
FIGURE 183 

CAACAGAAGCCAAGAAGGAAGCCGTCTATCTTGTGGCGATCATGTATAAGCTGGCCTCCTGC 
TGTTTGCTTTTCACAGGATTCTTAAATCCTCTCTTATCTCTTCCTCTCCTTGACTCCAGGGA 
AATATCCTTTCAACTCTCAGCACCTCATGAAGACGCGCGCTTAACTCCGGAGGAGCTAGAAA 
GAGCTTCCCTTCTACAGATATTGCCAGAGATGCTGGGTGCAGAAAGAGGGGATATTCTCAGG 
AAAGCAGACTCAAGTACCAACATTTTTAACCCAAGAGGAAATTTGAGAAAGTTTCAGGATTT 
CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAAACCATACA 
AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCTGAAGTGAAATAAGCATCTGT 
TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACAATGCTTGATTTGAAAAC 
AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 
ATGTTTTGCACAAAAAAAAAAAAAAA 
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184/330 
FIGURE 184 

MYKLASCCLLFTGFLNPLLSLPLLDSREISFQLSAPHEDARLTPEELERASLLQILPEMLGA 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIWKPYKKRETPDCFWKYCV 
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185/330 
FIGURE 185 

GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 
CCACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCCAT6CCCTCCCC 
AGGGACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCA 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGACAAGSSATCGCCCACAAGCCTTACTCACCTCTCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAGCTCAGGAGGCGAATAAATGTTCAAACTGTA 
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186/330 

FIGURE 186 

MPSPGTVCSLLLLGMLWLDLAMAGSSFLSPEHQRVQQRKESKKPPAKLQPRALAGWLRPEDG 
GQAEGAEDELEVRFNAPFDVGIKLSGVQYQQHSQALGKFLQDILWEEAKEAPADKO 
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187/330 
FIGURE 187 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 
GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 
TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTG 
CTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCT 
GAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 
TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 
GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 
GGGCAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 
TCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGA 
GACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTT 
CCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCA 
AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 
AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 
AACAACAACCTCCCTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCG 
TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGAC 
ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 
ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 
GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 
GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACAGAACCCTGGGGAT 
GTTTTAAAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 
ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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188/330 
FIGURE 188 

MNTWLLFLPLFPVQVQTLIVVIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGW 
VVRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSMRTQQ 
AQQEAELTPRPAGWPGA 
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189/330 
FIGURE 189 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACCATGGCCAAG 
ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCT 
ATCACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGG 
TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 
GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGA^TGA 
CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 
AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 
GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAA 
GCGGTTGATGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTA 
TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCT 
ACTAACAGACTTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTG 
CTGTTTCCTCTGTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 
TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 
GGCCTTCTACATGGCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCA 
ACACGTACACCAGGATGGTGCTGGAGTTCAAGTGCAAGCAT^TAAGAGCTTCAAGGAAAAC 
CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 
CGTGGGTCCTTTGACCAGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGG 
GAGTCGACTTCTACTCCGAGCTGCGGAACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTG 
AAAGAAGCAGTTAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 
GTAGGCTTGAGCCCTACCTTACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCG 
TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 
CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCAC 
CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 
TAAACCATGGAGATAAAAAGAAGAGT7UUy\TACACTTCCCGACCTTAAGGATCTGAAA 
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190/330 
FIGURE 190 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMP 
VSLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKR 
GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQL 
PPATNRLATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLE 
LWLGLLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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191/330 
FIGURE 191 

AACTGGAAGGAAAGAAAGAAAGGTCAGCTTTGGCCCAGATGTGGTTACCCCTTGGTCTCCTG 
TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCAGGCCTGTCA 
GCAGCTCCTGTGGACATTGCCATCCCCTCTGGTAGCCTTCAGAGCAAACAGGACAACCTATG 
TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCTTCTGT 
GATCTCTATGACAGAGCCACTTCTCCACCTCTGAAATGTTCCCTGCTCTGAAATCTGGCATG 
AGATGGCACAGGTGACCACGCAGAAGCCACCAGAATCTTGCCTGCCCTATTCCTCCTCCCAA 
GTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGGCATTACAGAGAAAG 
CAATCTGTGTGGCTAGTGGGCAGATTACCATGCAAGCCCCAGGAGAAATGGAGGAGCTTTGT 
AGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCGTAGATTCAG 
GACATTCGCCCCTGTGTGCCACCAAACCAGGACTTTCCCCTTGGCTTGGCATCCCTGGCTCT 
CTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTAGCATCTTTCAAGCTCCGTTACT 
ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 
GAGGGAAATGGGGCCATGTGAATGCAGCTGCTCTGTTCTCCCTACCCTGAGGAAAAACCAAA 
GGGAAGCAACAGGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 
TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 
ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 
AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 
AACCAAGGAAACTAACAATGTAGGTTACTAGTGAATACCGCAATGGTTTCTCCAATTATGCC 
CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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192/330 
FIGURE 192 

MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNHLCFCDLYDRATSPPLKCSLL 
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193/330 



FIGURE 193 



GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCAGGAGCGACGTCA 
CCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGGAGCAATCGGACTGATGTTTTTGATGCTT 
GGATGTGCCCTTCCAATATACAACAAATACTGGCCCCTCTTTGTTCTATTTTTTTACATCCTTTCACCTATTCC 
ATACTGCATAGCAAGAAGATTAGTGGATGATACAGATGCTATGAGTAACGCTTGTAAGG7UVCTTGCCATCTTTC 
TTACAACGGGCATTGTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGA 
GCTTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCAACTATACTAGGCTTTTTCTTGGTCTTTGGAAG 
CAATGACGACTTCAGCTGGCAGCAGTGGTGAAAAGAAATTACTGAACTATTGTCAAATGGACTTCCTGTCATTT 
GTTGGCCATTCACGCACACAGGAGATGGGGCAGTTAATGCTGAATGGTATAGCAAGCCTCTTGGGGGTATTTTA 
GGTGCTCCCTTCTCACTTTTATTGTAAGCATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCT 
CTTTTGGAAAAGCTTGACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATAT 
TTATGTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCAATATGCAATGTTAAACACTTTTTTAATGTAATCA 
TTTGCATTGGTTAGGAATTCAGAATTCCGCCGGCTCTATTACTGGTCAAGTACATCTTTTCTCTTAAAATTATT 
TAGCCTCCATTATTACAAAAAATTATAAAAATAAGTTTTCAGTCAGTCAGGATGACATCACTCCCAATGTTATG 
CAGACATACAGACGGTTGGCATACGTTATAGACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAG 
GGGCCAAGTGTTAATGCCCATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTG 
AAAATTATTTTATGGAATTGCTACAGAGGAGTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAAACTTTA 
AGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGTTAGAGTGAGTTGCAATGT 
GGGAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCTATTTATAAGTGAAATTTGTGATCTCCTATC 
AACCTTTCATGTTTTACCCTGTTAAAATGGACATACATGGAACCACTACTGATGAGGGACAGTTGTATGTTTGC 
ATCATATATGCCAGAAAACCTTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAA 
TAACTTTTCAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTACATT 
CAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTTACAGATAATTCATGCA 
TTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTCTAAGGTTATATCATATGTAATTTAAAAG 
TATTTTTAAGACAAGTTTCCTGTATACCTCTGAACTGTTTTGATTTTGAGTTCATCATGATAGATCTGCTGTTT 
CCTTATAAAAGGCATTTGTTGTGTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACAT 
ACCTGACCAAAAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATCAGGA 
CTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGATATTTGTTTGTATG 
TTTATTCAGTATACTTACATAAAAATTATTTCGCCATCAGCCAAAACTCAGTAATCATGACAGCTGTCTGTTGT 
TTTATGAAGTTTATTTCTCAAGAAAATGGGAATAAATTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAA 
AGCCACAGGTTTTATTGCCTAACTTAAGCCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATATCG 
GCGTGTGGCTGGAGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 
GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGATCTACTGGACTTT 
TTTTTTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGATGTCAGTGCAGTGCACTGCTACTG 
TTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGCGTATTATTTCTATATACTAATTGCATTGGCAGCATT 
GTGTCTTTGACCTTGTATACTAGCTTGACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTGAGATATGAAT 
TTTCCATAGAATATGCACTGATACAACATTACCATTCTTCTATGGAAAGAAAACTTTTGATGATGAAACAATAA 
AGATTTTAAATATCTATTTTAAAAAAAAAA 



wo 00/73454 



PCT/USOO/08439 



194/330 
FIGURE 194 

MAGIKALISLSFGGAIGLMFLMLGCALPIYNKYWPLFVLFFYILSPIPYCIARRLVDDTDAM 
SNACKELAIFLTTGIWSAFGLPIVFARAHLIEWGACALVLTGNTVIFATILGFFLVFGSND 
DFSWQQW 
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195/330 
FIGURE 195 

CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCC 
CACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGAGCCCGGGCGCCTCTCGCCGCTCCC 
CGCGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCGCCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTC 
TCTGCAGCTGCCCGGGGCGCGAATGGCAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTC 
CAATGACGGACATTAACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCAC 
AACGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGCACGTAGTTGG 
GAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCAGCCCCGAGCCGAGCAGCAAAGT 
GAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCGGGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTC 
GCCCTGCATATAATTAGCCTGCACACAAAGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCT 
GACCGAGCGCTTCCAATGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAMGGATTTCCTGCTGCTCGGT 
CTCTGTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTTCAGATGCT 
GCCCGCCGCCCCCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCTGTACTGCGAGGCGCTCAACC 
TCACCGAGGCGCCCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCGCTACAACAGCCTCTCGGAGCTGCGC 
GCCGGCCAGTTCACGGGGTTAATGCAGCTCACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGG 
GGACGCCTTTCAGAAACTGCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACA 
CCACCTTCCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCGCCCGAC 
CTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCAGTTTGTGCCCGTGCGCAT 
CTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACAATCAGCTCAAGAGTCTGGCGCGCAACTCTT 
TCGCCGGCTTGTTTAAGCTCACCGAGCTGCACCTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTC 
CCGCGCCTCATCTCCCTGCACTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTG 
GGTTTGGAACCTGGAGAAAATGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAGACCG 
TGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCGGATCCTCAACTCTTGG 
AAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCGGGCGCAACGTGTGTGCCCTAGCCTCGTG 
GCTCAGCAACTTCCAGGGGCGCTACGATGGCAACTTGCAGTGGGCCAGCCCGGAGTACGCACAGGGCGAGGACG 
TCCTGGACGCCGTGTACGCCTTCCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCC 
GTCACCAACCGCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAGCA 
CGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGCCGTGCAGATCCACA 
AGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTCCTCATCGTGGTCCTGGTGCTCTACGTGTCCTGGAAG 
TGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGCTTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGAC 
CATGCATCAGATGGCTGCCATGTCTGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCC 
TGGTGATCATCAACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGT6ATTGTCC 
CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGGGACGGGCCGGCGGGCACCAGGCTGGGGT 
CTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATCTCTCCCAGAGACTTGACATTTTAG 
CTTTATTGTGTCTTAAAAACAAAAGCGAATTAAAACACAACAAAAAACCCCACCCCACAACCTTCAGGACAGTC 
TATCTTAAATTTCATATGAGAACTCCTTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAA 
AAAGGTGGCCATAAGACAGAGAGAGAATAATCGTGCTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTA 
AACATCATGTATGTAGAAGATCTTAAGTCCATACGCATTTCATGAAGAACCATTGGAAAGAGGAATCTGCAATC 
TGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGTTCTTACTTTGTGTGTGTGTCTGTATGTTTCTGCG 
TTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACAAACGGGAATTTAGCTCACATCATTTCATGCCCCTGT 
GCCTCTAGCTCTGGAGATTGGTGGGGGGAGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACT 
AAGGTTTTGTAACACTTGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGAAACTTGCTCTGACC 
CCTCTAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTTTATTCTTC 
CCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCCACAATTCACGTCTCGTCCTG 
GGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCAGCTGATTAAAGTTCAGCAGTGGTATTGAGGTT 
TTTCGAATATTTATATAGAAAAAAAGTCTTTTCACATGACAAATGACACTCTCACACCAGTCTTAGCCCTAGTA 
GTTTTTTAGGTTGGACCAGAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAAATAGGCAGT 
CCCTGATGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCTATACTCCCAAG 
ACT AC AAAGT T ACC AT CGCAAAGG AAAGGTT ATT CO AGT AAAAGGAAAT AGTTT TOT C AAC C ATTT AAAAATAT 
TCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCTCTGCCTTCAAGAAGGCAGACATTTGGTATG 
ATTTAGCATCAACAACACATTTATGAGTATATGTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCA 
AGTTTTCCAAGCAAGTACACACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCTTATTTTAGTCGA 
CTTGTCAGCAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAGGGCATTGATGGATCACATGAGAT 
GGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAACCAGAAATTATATCTGTTTTGG 
AGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAAACATAAATTATCTCCTCTAGATGAGTGGCGATGTTG 
GCTGATTTGGGTCTGCCATTGACAGAATGTCAAATAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTT 
CTGAAATATATTTTGAGATAGGTTTAGAATGTCA 
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196/330 
FIGURE 196 

MDFLLLGLCLYWLLRRPSGVVLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPH 
NLSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQ 
ITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRS 
LKFLDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVKVNFAHFPRLISLHSLCLRRNKVAIV 
VSSLDWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSIT 
LAGNLWDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSG 
HLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKVVTGTMA 
LIFSFLIWLVLYVSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNH 
lEGALVIINEYGSCTCHQQPARECEV 
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197/330 
FIGURE 197 

GTGCAAGGAGCCGAGGCGAGSlSGGGCGTCCTGGGCCGGGTCCTGCTGTGGCTGCAGCTCTGC 
GCACTGACCCAGGCGGTCTCCAAACTCTGGGTCCCCAACACGGACTTCGACGTCGCAGCCAA 
CTGGAGCCAGAACCGGACCCCGTGCGCCGGCGGCGCCGTTGAGTTCCCGGCGGACAAGATGG 
TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGCTCCTGCCGCTGGATGGGGAA 
CTCGTCCTGGCTTCAGGAGCCGGATTCGGCGTCTCAGACGTGGGCTCGCACCTGGACTGTGG 
CGCGGGCGAACCTGCCGTCTTCCGCGACTCTGACCGCTTCTCCTGGCATGACCCGCACCTGT 
GGCGCTCTGGGGACGAGGCACCTGGCCTCTTCTTCGTGGACGCCGAGCGCGTGCCCTGCCGC 
CACGACGACGTCTTCTTTCCGCCTAGTGCCTCCTTCCGCGTGGGGCTCGGCCCTGGCGCTAG 
CCCCGTGCGTGTCCGCAGCATCTCGGCTCTGGGCCGGACGTTCACGCGCGACGAGGACCTGG 
CTGTTTTCCTGGCGTCCCGCGCGGGCCGCCTACGCTTCCACGGGCCGGGCGCGCTGAGCGTG 
GGCCCCGAGGACTGCGCGGACCCGTCGGGCTGCGTCTGCGGCAACGCGGAGGCGCAGCCGTG 
GATCTGCGCGGCCCTGCTCCAGCCCCT 
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198/330 

MGVLGRVLLWLQLCALTQAVSKLWVPNTDFDVAANWSQNRTPCAGGAVEFPADKMVSVLVQE 
GHAVSDMLLPLDGELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPCRHDDVFFPPSASFRVGLGPGASPVRVRSISALGRTFTRDEDLAVFLASR 
AGRLRFHGPGALSVGPEDCADPSGCVCGNAEAQPWICAALLQP 
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199/330 
FIGURE 199 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTT 
TGAGCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTT 
CTGCCTTTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCA 
TTGACGTGGTACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGC 
CGTGTGTCTTGCATAATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAG 
CTGGTGGACAGTTTTGTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCT 
TGCAGCAATGTGTTGCTTGTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACAT 
TCTCGAACCTGAATGTCAGTAGCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAA 
TCTTCCTCATGTACCTGTTTCCTCTCTGGATGTTGTCCCACTGAATTCCCATGAATACAAAC 
CTATTCAGCAACAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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200/330 
FIGURE 200 

MGPVKQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWYSLSFIP 
FARDAVKKCFAVCLA 
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201/330 



FIGURE 201 



TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTACCCGCGGCGCGC 
CCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACGGACTGAACGGATCGCTGCGAGGGT 
GGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCAACTACCATCCATAGCCAGATAGATTATCTTACACTG 
AACTGATCAAGTACTTTGAAAATGACTTCGAAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTC 
AACCACCTTTTCTCTCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACT 
TATATAAAGTTCCAACGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGTTACTAATGTT 
TTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTGCAGAGAATCATGGGATTGTTGC 
AAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCCTTGGATCACATGAATATTTATGATTCCAAGTTTT 
GGGAAGAAGCGACACCAATATGGATCACAAACCAGAGGGCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGA 
ACAGATGTAAAAATACATAAGCGCTTTCCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAG 
AGTTGCCAAAATTGTTGAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGAAGACCCTG 
ATGACATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTTCAGATATTGACAAGAAGTTA 
GGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAACACTCTGAACCTAATCATCACAAGTGATCATGG 
AATGACGCAGTGCTCTGAGGAAAGGTTAATAGAACTTGACCAGTACCTGGATAAAGACCACTATACCCTGATTG 
ATCAATCTCCAGTAGCAGCCATCTTGCCAAAAGAAGGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCT 
CATCCTAATCTTACTGTTTACAAAAAAGAAGACGTTCCAGAAAGGTGGCATTACAAATACAACAGTCGAATTCA 
ACCAATCATAGCAGTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATGACTTTCTGTTAGGCAACC 
ACGGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTAGCCCATGGTCCTGCCTTCAGAAAGAATTTC 
TCAAAAGAAGCCATGAACTCCACAGATTTGTACCCACTACTATGCCACCTCCTCAATATCACTGCCATGCCACA 
CAATGGATCATTCTGGAATGTCCAGGATCTGCTCAATTCAGCAATGCCAAGGGTGGTCCCTTATACACAGAGTA 
CTATACTCCTCCCTGGTAGTGTTAAACCAGCAGAATATGACCAAGAGGGGTCATACCCTTATTTCATAGGGGTC 
TCTCTTGGCAGCATTATAGTGATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTCACAGTCAAATACCTGC 
CTTACAAGATATGCATGCTGAAATAGCTCAACCATTATTACAAGCCTAATGTTACTTTGAAGTGGATTTGCATA 
TTGAAGTGGAGATTCCATAATTATGTCAGTGTTTAAAGGTTTCAAATTCTGGGAAACCAGTTCCAAACATCTGC 
AGAAACCATTAAGCAGTTACATATTTAGGTATACACACACACACACACACACACATACACACACACGGACCAAA 
ATACTTACACCTGCAAAGGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAGCATAGGGATAGATAAG 
ATCCTGCTTTATTTGGACTTGGCGCAGATAATGTATATATTTAGCAACTTTGCACTATGTAAAGTACCTTATAT 
ATTGCACTTTAAATTTCTCTCCTGATGGGTACTTTAATTTGAAATGCACTTTATGGACAGTTATGTCTTATAAC 
TTGATTGAAAATGACAACTTTTTGCACCCATGTCACAGAATACTTGTTACGCATTGTTCAAACTGAAGGAAATT 
TCTAATAATCCCGAATAATGAACATAGAAATCTATCTCCATAAATTGAGAGAAGAAGAAGGTGATAAGTGTTGA 
AAATTAAATGTGATAACCTTTGAACCTTGAATTTTGGAGATGTATTCCCAACAGCAGAATGCAACTGTGGGCAT 
TTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCATTTATTTTTCCCTCAAAAGAGAGTCAAATACTGACAG 
ATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGATTTCCTGATGAGTCATATTACTGTGATTTTCA 
TAATAATGAAGACACCATGAATATACTTTTCTTCTATATAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCA 
CCATCTCAGCAATGTTTTCTCTTGTTTGTAATTATTTGCTCCTTTGAAAATTAAATCACTATTAATTACATTAA 
AAATCAAATTGGATAAAAAAAAAAAAAAAAAAA 
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202/330 
FIGURE 202 

MTSKFILVSFILAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVK 
QVTNVFITKTYPNHYTLVTGLFAENHGIVANDMFDPIRNKSFSLDHMNIYDSKFWEEATPIW 
ITNQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLY 
WEDPDDMGHHLGPDSPLMGPVISDIDKKLGYLIQMLKKAKLWNTLNLIITSDHGMTQCSEER 
LIELDQYLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYN 
SRIQPIIAVADEGWHILQNKSDDFLLGNHGYDNALADMHPIFLAHGPAFRKNFSKEAMNSTD 
LYPLLCHLLNITAMPHNGSFWNVQDLLNSAMPRWPYTQSTILLPGSVKPAEYDQEGSYPYF 
IGVSLGSIIVIVFFVIFIKHLIHSQIPALQDMHAEIAQPLLQA 

Signal Peptide: 

amino acids 1-22 

Tremsmexnbrane Domain: 

amino acids 429-452 

K-glycosylation sites: 

amino acids 101-104, 158-161, 292-295, 329-332, 362-365, 369- 
372, 382-385, 389-392 

Somatomedin B Domain: 

amino acids 69-85 

Sulfatase protein Region: 

amino acids 212-241 
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203/330 
FIGURE 203 

GGATTTTTGTGATCCGCGATTCGCTCCCACGGGCGGGACCTTTGTAACTGCGGGAGGCCCAG 
GACAGGCCCACCCTGCGGGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGAAACCAAGACGC 
AGAGAGGCCAAGCCCCTTGCCTTGGGTCACACAGCCAAAGGAGGCAGAGCCAGAACTCACAA 
CCAGATCCAGAGGCAACAGGGACAESGCCACCTGGGACGAAAAGGCAGTCACCCGCAGGGCC 
AAGGTGGCTCCCGCTGAGAGGATGAGCAAGTTCTTAAGGCACTTCACGGTCGTGGGAGACGA 
CTACCATGCCTGGAACATCAACTACAAGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 
AGCAGCGACCACCCACACCAGTCTCAGGCGAGGAAGGCAGAGCTGCAGCCCCTGACGTTGCC 
CCTGCCCCTGGCCCCGCACCCAGGGCCCCCCTTGACTTCAGGGGCATGTTGAGGAAACTGTT 
CAGCTCCCACAGGTTTCAGGTCATCATCATCTGCTTGGTGGTTCTGGATGCCCTCCTGGTGC 
TTGCTGAGCTCATCCTGGACCTGAAGATCATCCAGCCCGACAAGAATAACTATGCTGCCATG 
GTATTCCACTACATGAGCATCACCATCTTGGTCTTTTTTATGATGGAGATCATCTTTAAATT 
ATTTGTCTTCCGCCTGAGTTCTTTCACCACAAGTTTGAGATCCTGGATGCCCGTCGTGGTGG 
TGGTCTCATTCATCCTGGACATTGTCCTCCTGTTCCAGGAGCACCAGTTTGAGGCTCTGGGC 
CTGCTGATTCTGCTCCGGCTGTGGCGGGTGGCCCGGATCATCAATGGGATTATCATCTCAGT 
TAAGACACGTTCAGAACGGCAACTCTTAAGGTTAAAACAGATGAATGTACAATTGGCCGCCA 
AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGACTGATGAGTTTGCTGTATC 
AACCTGT7\AGGAGAAGCTCTCTCCGGATGGCTATGGGAATGAAAGAATCCGACTTCTACTCT 
CACACAGCCACCGTGAAAGTCCTGGAGTAAAATGTGCTGTGTACAGAAGAGAGAGAAGGAAG 
CAGGCTGGCATGTTCACTGGGCTGGTGTTACGACAGAGAACCTGACAGTCACTGGCCAGTTA 
TCACTTCAGATTACAAATCACACAGAGCATCTGCCTGTTTTCAATCACAAGAGAACAAAACC 
AAAATCTATAAAGATATTCTGT^/VAATATGACAGAATTTGACAAATAAAAGCATAAACGTGTA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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204/330 
FIGURE 204 

MATWDEKAVTREU^KVAPAERMSKFLRHFTVVGDDYHAWNINYKKWENEEEEEEEEQPPPTPV 
SGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQVIIICLWLDALLVLAELILDL 
KIIQPDKNNYAAMVFHYMSITILVFFMMEIIFKLFVFRLSSFTTSLRSWMPVWWSFILDI 
VLLFQEHQFEALGLLILLRLWRVARIINGIIISVKTRSERQLLRLKQMNVQLAAKIQHLEFS 
CSEKPLD 
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205/330 

FIGURE 205 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACAS 
GCTCTGTCTGTGCCTGTACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTG 
AGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCCATTTTCAAGCTCAGTGTCTTCATCCCCTCC 
CAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCTGGAGATAAGGACCT 
TGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGMGCTGA 
GGCTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATG 
CAGTCCCTGCGGGACTTGGGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAG 
CATGGATAAAAACGGCACGATGACCATCGACTGGAACGAGTGGAGAGACTACCACCTCCTCC 
ACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACGATCTTTGATGTG 
GGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTG 
GAGACACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGG 
ACAGGCTCAAGGTGCTCATGCAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGT 
GGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGGTCACTCTGGCGGGGCAATGGCATCAA 
CGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAGATCAAGCGCC 
TTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCA 
GGGGCCATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCG 
G7VAGACAGGCCAGTACTCAGGAATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGG 
TGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTGGGCATCATCCCCTATGCCGGCATC 
GACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTGAACAGCGC 
GGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGG 
CCAGCTACCCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCT 
CCGGAGGTGACCATGAGCAGCCTCTTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCT 
GTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCAGCTGTGAGCATCAGCTACGTGG 
TCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGT6ACGGGGGGAGGGCCGCCCG 
GCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATG 
TGCCAACACTAAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGT 
GGGGAGAGCTGGCAGGCCCAGGGCTTGTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCC 
AGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGGCTCCGGGCTCACATGTGTAA 
GGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCTTAGT 
TCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGC 
ATCTCCCTGTGCCCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAG 
CCCACATCCCACCCCCTCGTCCAATCCCATAATCCATGATGAAAGGTGAGGTCACGTGGCCT 
CCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGCTGTGAAGGAAGAGGAAAG 
GATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATGCT 
TGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCA 
TGGTGCTCTGAGCTGGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTG 
TCCCCACTGTGGCATGAGGGCAGTGGAGCACCATGTTTGAGGGCGAAGGGCAGAGCGTTTGT 
GTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGGACTGTTGGGAAAAGGG 
TTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACC 
CCAGCAGGGGCGCAGCGGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTT 
ATTTTGTATTTATTTGAACAGAGTTATGTCCTAACTATTTTTATAGATTTGTTTAATTAATA 
GCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTTGATTGTACCTTCCC 
AAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACAT 
CTGTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTG 
GCTCCTTTCCTTTGGCAGGTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGA 
CTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAATAACCTTGAAGGTGGAATCCAGTTATTTC 
CTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAGTGAGGTGCCTCT 
CACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCC 
CAGCCTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCC 
ATTCCACCAGAATGACCTGATGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATT 
GTTATATATGAACATATAACTGGAGTCGTCAAAAAGCAAATTAAGAAAGAATTGGACGTTAG 
AAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAAAAAAAAAAAA 
AAAAAAAAT^AAT^AAAAAAAAAAATVAAAAAAAAAAAAAAAAAAAAAAAA 
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206/330 
FIGURE 206 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKD 
LDGQLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILK 
SMDKNGTMTIDWNEWRDYHLLHPVENIPEIILYWKHSTIFDVGENLTVPDEFTVEERQTGMW 
WRHLVAGGGAGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGI 
NVLKIAPESAIKFMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMAL 
RKTGQYSGMLDCARRILAREGVAAFYKGYVPNMLGilPYAGIDLAVYETLKNAWLQHYAVNS 
ADPGVFVLLACGTMSSTCGQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFG 
LYRGLAPNFMKVIPAVSISYWYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation site. 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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207/330 

FIGURE 207 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCAT 
GGCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAG 
CAATTGCACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTC 
GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA 
ACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCA 
AAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTT 
GCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGC 
TGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATA 
AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTG 
CGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCA 
GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCA 
TGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACACATACTCCTGTATGATTGAA 
AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAGAATCGGAGATCAAAAGGCGGAG 
TCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCT 
GGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTTGGCCACAAAAAAG 
CATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCACCACCAGATATGACCTAG 
TTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTGAGCAAACAAGAGCA 
AGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAA 
GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAG 
TAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCT 
GGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTG 
TAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCA 
CAAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 
GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCT 
TGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAA 
ACAGAGCAGTCGGGGACACCGATTTTATAAAT/^CTGAGCACCTTCTTTTTAAACAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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208/330 

FIGURE 208 

MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDI 
KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTD 
AGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVD 
QGANFSEVSNTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRR 
SHLQLLNSKASLCVS S FFAI S WALL PLS P YLMLK 
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209/330 
FIGURE 209 

GAATTTGTAGAAGACAGCGGCGTTGCCATGGCGGCGTCTCTGGGGCAGGTGTTGGCTCTGGT 
GCTGGTGGCCGCTCTGTGGGGTGGCACGCAGCCGCTGCTGAAGCGGGCCTCCGCCGGCCTGC 
AGCGGGTTCATGAGCCGACCTGGGCCCAGCAGTTGCTACAGGAGATGAAGACCCTCTTCTTG 
AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTGGATCCCTTCTCTATTACCTCAC 
CTTGGCATCGACAGATCTGACCCTGGCTGTGCCCATCTGTAACTCTCTGGCTATCATCTTCA 
CACTGATTGTTGGGAAGGCCCTTGGAGAAGATATTGGTGGAAAACGTAAGTTAGACTACTGC 
GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCTTCCCAGAACCCAT 
CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCTCTGCAGCTGTTTT 
GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTGGAGAAAGACAGAGGCTGGG 
GTTTGGGATTGAAGACCAGACCCCATCTGAGCCCTTCCTCCAGCCCTGTACCAGCTCCTACT 
GGCATGGCTGAGCTCAGACCCTCCTGATTTCTGCCTATTATCCCAGGAGCAGTTGCTGGCAT 
GGTGCTCACCGTGATAGGAATTTCACTCTGCATCACAAGCTCAGTGAGTAAGACCCAGGGGC 
AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTGCTGCCTCCAGGAAGCCCCT 
GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGGCCTTAGCTT 
CCTCCTCTCTTATGGGGATAACAGCTACCTCATGGATCACAATAAGAGAACAAGAGTGAAAG 
AGTTTTGTAACCTTCAAGTGCTGTTCAGCTGCGGGGATTTAGCACAGGAGACTCTACGCTCA 
CCCTCAGCAACCTTTCTGCCCCAGCAGCTCTCTTCCTGCTAACATCTCAGGCTCCCAGCCCA 
GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAACT 
CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGAAATGCAAACGGGAGGCCTCTG 
GGACTCAGTCAGAGCGCTTTGGCTGAATGAGGGGTGGAACCGAGGGAAGAAGGTGCGTCGGA 
GTGGCAGATGCAGGAAATGAGCTGTCTATTAGCCTTGCCTGCCCCACCCATGAGGTAGGCAG 
AMTCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCTGTGAGCCCCAGCCCCACCTGAC 
TCCAGCACACCTGGCGAGTAGTAGCTGTCAATAAATCTATGTAAACAGACAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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210/330 
FIGURE 210 

MAASLGQVLALVLVAALWGGTQPLLKRASAGLQRVHEPTWAQQLLQEMKTLFLNTEYLMPFL 
LNQCGSLLYYLTLASTDLTLAVPICNSLAIIFTLIVGKALGEDIGGKRKLDYCECGTQLCGS 
RHTCVSSFPEPISPEWVRTRPFPILPFPLQLFCFLVAIRVPFPWTVWRKTEAGVWD 
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211/330 
FIGURE 211 

CTTCTGTAGGACAGTCACCAGGCCAGATCCAGAAGCCTCTCTAGGCTCCAGCTTTCTCTGTG 
GAAGATGACAGCAATTATAGCAGGACCCTGCCAGGCTGTCGAAAAGATTCCGCAATAAAACT 
TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGATGCCACTTCTTCTCATGTCCCAGGCT 
TGAGGCCCTGTGGTCCCCATCCTTGGGAGAAGTCAGCTCCAGCACCATGAAGGGCATCCTCG 
TTGCTGGTATCACTGCAGTGCTTGTTGCAGCTGTAGAATCTCTGAGCTGCGTGCAGTGTAAT 
TCATGGGAAAAATCCTGTGTCAACAGCATTGCCTCTGAATGTCCCTCACATGCCAACACCAG 
CTGTATCAGCTCCTCAGCCAGCTCCTCTCTAGAGACACCAGTCAGATTATACCAGAATATGT 
TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACTGTCCACGTGTCT 
GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTGCCAAGGAAAGGAATGCAGCAACACCAG 
CGATGCCCTGGACCCTCCCCTGAAGAACGTGTCCAGCAACGCAGAGTGCCCTGCTTGTTATG 
AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTGTGTC 
TTTCTAGTTGCAGAACTTAAGAATGACATTGAGTCTAAGAGTCTCGTGCTGAAAGGCTGTTC 
CAACGTCAGTAACGCCACCTGTCAGTTCCTGTCTGGTGAAAACAAGACTCTTGGAGGAGTCA 
TCTTTCGAAAGTTTGAGTGTGCAAATGTAAACAGCTTAACCCCCACGTCTGCACCAACCACT 
TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTTCTTCG 
GGGACTGCTGCCCTGAGGTCCTGGGGCTGCACTTTGCCCAGCACCCCATTTCTGCTTCTCTG 
AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTTAACTGC 
CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTGTTCTTCATTATTA 
AAGCACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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212/330 
FIGURE 212 

MKGILVAGITAVLVAAVESLSCVQCNSWEKSCVNSIASECPSHANTSCISSSASSSLETPVR 
LYQNMFCSAENCSEETHITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACYESNGTSCRGKPWKCYEEEQCVFLVAELKNDIESKSLVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHNVGSKASLYLLALASLLLRGLLP 
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213/330 
FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAGGGAGGGAGCGAAGGTAGGAGGCA 
GGGCTTGCCTCACTGGCCACCCTCCCAACCCCAAGAGCCCAGCCCCASSGTCCCCGCCGCCG 
GCGCGCTGCTGTGGGTCCTGCTGCTGAATCTGGGTCCCCGGGCGGCGGGGGCCCAAGGCCTG 
ACCCAGACTCCGACCGAAATGCAGCGGGTCAGTTTACGCTTTGGGGGCCCCATGACCCGCAG 
CTACCGGAGCACCGCCCGGACTGGTCTTCCCCGGAAGACAAGGATAATCCTAGAGGACGAGA 
ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTCTTGGCCGCC 
ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCCATTAACGAGGAGGATGGGTCTTCAGAAGA 
GGGGGTTGTGATTAATGCCGGAAAGGATAGCACCAGCAGAGAGCTTCCCAGTGCGACTCCCA 
ATACAGCGGGGAGTTCCAGCACGAGGTTTATAGCCAATAGTCAGGAGCCTGAAATCAGGCTG 
ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACTGAGGACCTGCCAGGCTCGCAGGCCAC 
CCTGAGCCAGTGGTCCACACCTGGGTCTACCCCGAGCCGGTGGCCGTCACCCTCACCCACAG 
CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCTGGGGCCCGTGGCACTGCCAC 
TGCAAGTCGGGCACCATGAGCCGGAGCCGGTCTGGGAAGCTGCACGGCCTTTCCGGGCGCCT 
TCGAGTTGGGGCGCTGAGCCAGCTCCGCACGGAGCACAAGCCTTGCACCTATCAACAATGTC 
CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGACACCAACTGT 
GCCTCTCAGAGCACCACCAGTACCAGGACCACCACTACCCCCTTCCCCACCATCCACCTCAG 
AAGCAGTCCCAGCCTGCCACCCGCCAGCCCCTGCCCAGCCCTGGCTTTTTGGAAACGGGTCA 
GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGCAACCAATA 
GACAGAAACCAGAGGTAATGGCCACTTCATCCACATGAGGAGATGTCAGTATCTCAACCTCT 
CTTGCCCTTTCAATCCTAGCACCCACTAGATATTTTTAGTACAGAAAAACAAAACTGGAAAA 
CACAA 
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214/330 
FIGURE 214 

MVPAAGALLWVLLLNLGPRAAGAQGLTQTPTEMQRVSLRFGGPMTRSYRSTARTGLPRKTRI 
ILEDENDAMADADRLAGPAAAELLAATVSTGFSRSSAINEEDGSSEEGWINAGKDSTSREL 
PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 
SPSPTAMPSPEDLRLVLMPWGPWHCHCKSGTMSRSRSGKLHGLSGRLRVGALSQLRTEHKPC 
TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 
FWKRVRIGLEDIWNSLSSVFTEMQPIDRNQR 
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215/330 



FIGURE 215 



CCCGGGTCGACCCACGCGTCCGGGGAGAAAGGATGGCCGGCCTGGCGGCGCGGTTGGTCCTGCTAGCTGGGGCA 
GCGGCGCTGGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGACTGCGTACTGCAGTGCGAAGAGCA 
GAACTGCTCTGGGGGCGCTCTGAATCACTTCCGCTCCCGCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCT 
GTCGGGACGACTGTAAGTATGAGTGTATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCACA/^GTGCCT 
CAGTTCCATGGCAAGTGGCCCTTCTCCCGGTTCCTGTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCT 
CAATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCCCCATGTAGCACA 
CCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGTTCTGGTCCACAGTCTTCCACACCAGGGACACTGAC 
CTCACAGAGAAAATGGACTACTTCTGTGCCTCCACTGTCATCCTACACTCAATCTACCTGTGCTGCGTCAGGAC 
CGTGGGGCTGCAGCACCCAGCTGTGGTCAGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTGCACGTCT 
CCTACCTGAGCCTCATCCGCTTCGACTATGGCTACAACCTGGTGGCCAACGTGGCTATTGGCCTGGTCAACGTG 
GTGTGGTGGCTGGCCTGGTGCCTGTGGAACCAGCGGCGGCTGCCTCACGTGCGCAAGTGCGTGGTGGTGGTCTT 
GCTGCTGCAGGGGCTGTCCCTGCTCGAGCTGCTTGACTTCCCACCGCTCTTCTGGGTCCTGGATGCCCATGCCA 
TCTGGCACATCAGCACCATCCCTGTCCACGTCCTCTTTTTCAGCTTTCTGGAAGATGACAGCCTGTACCTGCTG 
AAGGAATCAGAGGACAAGTTCAAGCTGGACTGAAGACCTTGGAGCGAGTCTGCCCCAGTGGGGATCCTGCCCCC 
GCCCTGCTGGCCTCCCTTCTCCCCTCAACCCTTGAGATGATTTTCTCTTTTCAACTTCTTGAACTTGGACATGA 
AGGATGTGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGGCCCTCACCAGCCTTGGAGTCTGTTCTAGGG 
AAGGCCTCCCAGCATCTGGGACTCGAGAGTGGGCAGCCCCTCTACCTCCTGGAGCTGAACTGGGGTGGAACTGA 
GTGTGTTCTTAGCTCTACCGGGAGGACAGCTGCCTGTTTCCTCCCCACCAGCCTCCTCCCCACATCCCCAGCTG 
CCTGGCTGGGTCCTGAAGCCCTCTGTCTACCTGGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCC 
CTTCTGTTACCACCCCCCACCCTCCTCCAGGACACCACTAGGTGGTGCTGGATGCTTGTTCTTTGGCCAGCCAA 
GGTTCACGGCGATTCTCCCCATGGGATCTTGAGGGACCAAGCTGCTGGGATTGGGAAGGAGTTTCACCCTGACC 
GTTGCCCTAGCCAGGTTCCCAGGAGGCCTCACCATACTCCCTTTCAGGGCCAGGGCTCCAGCAAGCCCAGGGCA 
AGGATCCTGTGCTGCTGTCTGGTTGAGAGCCTGCCACCGTGTGTCGGGAGTGTGGGCCAGGCTGAGTGCATAGG 
TGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGTGAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGT 
CGGGGAAGAGGTGTGGCTTCAAAGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGGGTTAGGGGAACGTGTG 
TGCGCGTGCTGGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTCCACAGTTGAGAGGTTGGAGCAGGAT 
GAGGGAATCCTGTCACCATCAATAATCACTTGTGGAGCGCCAGCTCTGCCCAAGACGCCACCTGGGCGGACAGC 
CAGGAGCTCTCCATGGCCAGGCTGCCTGTGTGCATGTTCCCTGTCTGGTGCCCCTTTGCCCGCCTCCTGCAAAC 
CTCACAGGGTCCCCACACAACAGTGCCCTCCAGAAGCAGCCCCTCGGAGGCAGAGGAAGGAAAATGGGGATGGC 
TGGGGCTCTCTCCATCCTCCTTTTCTCCTTGCCTTCGCATGGCTGGCCTTCCCCTCCAAAACCTCCATTCCCCT 
GCTGCCAGCCCCTTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGGAGAAAGCT 
TATGGCTGGGTCTGGTTTCTTCCCTTCCCAGAGGGTCTTACTGTTCCAGGGTGGCCCCAGGGCAGGCAGGGGCC 
ACACTATGCCTGTGCCCTGGTAAAGGTGACCCCTGCCATTTACCAGCAGCCCTGGCATGTTCCTGCCCCACAGG 
AATAGAATGGAGGGAGCTCCAGAAACTTTCCATCCCAAAGGCAGTCTCCGTGGTTGAAGCAGACTGGATTTTTG 
CTCTGCCCCTGACCCCTTGTCCCTCTTTGAGGGAGGGGAGCTATGCTAGGACTCCAACCTCAGGGACTCGGGTG 
GCCTGCGCTAGCTTCTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATGTGCTTAATAAATCAA 
TTCCAAGCCTCAAAAAAAAAAAAAAAAA 
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216/330 

MAGLAARLVLLAGAAALASGSQGDREPVYRDCVLQCEEQNCSGGALNHFRSRQPIYMSLAGW 
TCRDDCKYECMWVTVGLYLQEGHKVPQFHGKWPFSRFLFFQEPASAVASFLNGLASLVMLCR 
YRTFVPASSPMYHTCVAFAWVSLNAWFWSTVFHTRDTDLTEKMDYFCASTVILHSIYLCCVR 
TVGLQHPAWSAFRALLLLMLTVHVSYLSLIRFDYGYNLVANVAIGLVNWWWLAWCLWNQR 
RLPHVRKCWWLLLQGLSLLELLDFPPLFWVLDAHAIWHISTIPVHVLFFSFLEDDSLYLL 
KESEDKFKLD 

In^ortani: features: 
Signal pep'blde: 

amino acids 1-20 

Transmembrane domains: 

amino acids 105-123, 138-156, 169-185, 193-209, 221-240, 256-272 



K-glycosyla'bxon si-be. 
amino acids 40-44 

N-myris-toylation si-te. 

amino acids 43-4 9 



CUB domain proteins profile. 

amino acids 285-302 

Amiloride-sensitive sodium channels proteins . 

amino acids 162-186 
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217/330 
FIGURE 217 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGA 
CTATGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTG 
CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGAC 
CAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCA7\AGCCGAGGAGGAGC 
TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGG 
CAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAA 
ACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCA 
ACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATG 
GAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGA 
GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGG 
TACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTC 
TTTGATCTTGAATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGG 
TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGT 
TTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGA 
GCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGT 
CCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGC 
TCGGGGGGCTGCAGGTCCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTG 
CGCGTGGTCACACTGCTCTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGA 
GCTGACCCAGGAGATGTCCCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAG 
GCCTGTGGGAACAGGGCTGGTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGAT 
GCCCGTGAGAAGGTGCTGCAGACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCG 
TCAGGACCCCCAGCTCGGCAGGACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCA 
GCCTGGAGCTGCAGGATGGTGAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAAC 
AGCTTGCTGAAGGAGCTGAGATGAGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGA 
GGCTGAGGGGTGCCAGCGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCT 
T GGCCAT TAAAT GGAAACCTG AAGGC C AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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218/330 
FIGURE 218 

MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEEEL 
DAEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLDINTN 
TYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNWIETDMQIMV 
RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEPLVKEYAAF 
VLGAAFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKL 
GGLQVLRTLVQEKGTEVLAVRWTLLYDLVTEKMFAEEEAELTQEMSPEKLQQYRQVHLLPG 
LWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTLASLQAEYQVLAS 
LELQDGEDEGYFQELLGSVNSLLKELR 

Izoportan-b features: 
Signal peptide: 

amino acids 1-29 

Hypothetical YJIil26w/YLR351c/yhcX family protein. 

amino acids 364-373 

N-glycosylation site. 

amino acids 193-197, 236-240 

N-myristoylation site. 

amino acids 15-21, 19-25, 234-240, 251-257, 402-408, 451-457 



Homologous region SLSl protein. 

amino acids 68-340 
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219/330 
FIGURE 219 

TTCGGCTTCCGTAGAGGAAGTGGCGCGGACCTTCATTTGGGGTTTCGGTTCCCCCCCTTCCC 
CTTCCCCGGGGTCTGGGGGTGACATTGCACCGCGCCCCTCGTGGGGTCGCGTTGCCACCCCA 
CGCGGACTCCCCAGCTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCC 
TTCCCACCTGACCAGCC^TGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGC 
CCGGCCTTCGCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 
CGCAGGGGCATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGG 
TCCATGTGACCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCT 
GTCTCTGTCCTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 
TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCT 
ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCT 
GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCCTGACTTC 
AGCCTTTCTGACAGCAGCCATTATCCTGCTCCATACCTTTTGGGGAGTTGTGTTCTTTGATG 
CCTGTGAGAGGAGACGGTACTGGGCTTTGGGCCTGGTGGTTGGGAGTCACCTACTGACATCG 
GGACTGACATTCCTGAACCCCTGGTATGAGGCCAGCCTGCTGCCCATCTATGCAGTCACTGT 
TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCTCCGAAGTATTCAGCGCAGCC 
TCTTGTGTAAGGACTGACTACCTGGACTGATCGCCTGACAGATCCCACCTGCCTGTCCACTG 
CCCATGACTGAGCCCAGCCCCAGCCCGGGTCCATTGCCCACATTCTCTGTCTCCTTCTCGTC 
GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 
TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTGAATCTGCACTTATCCC 
CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCCCCCTGTGTCAGTGCTCTGCTCTCAC 
CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 
GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 
CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 
GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 
GGATAGATGAGCTCTGAGTTTCTCAGTACTCCCTCAAGACTGGACATCTTGGTCTTTTTCTC 
AGGCCTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGCCTTTTTTTCTTTTTT 
GAGGTGGGGGGAGGGAGGAGGTATATTGGAACTCTTCTAACCTCCTTGGGCTATATTTTCTC 
TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 
GGGGGAAAGGAAGGAAGTGCATGTTTGGGAACTGGCATTACTGGAACTAATGGTTTTAACCT 
CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 
CACTCCAGAGCTGCAGTGCCACTGGAGGAGTCAGACTACCATGACATCGTAGGGAAGGAGGG 
GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 
ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 
AATAAAAAAGAATTTGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAT^AAAAAAAA 
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220/330 
FIGURE 220 

MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVTDR 
SDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYVSGLS 
FGIISGVFSVINILADALGPGWGIHGDSPYYFLTSAFLTAAIILLHTFWGWFFDACERRR 
YWALGLWGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQRSLLCKD 
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221/330 
FIGURE 221 

AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACGGAAGATCAACCCA 
TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCGGNATCATCAGTGGTGTNT 
TNTCTGTTATCAATATTTTGGCTGATGCANTTGGGCCAGGTGTGGTTGGGATCCATGGAGAC 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAGCCATTATCCTGCTC 
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222/330 

FIGURE 222 

GACCGACCGTTCAGATGCCCGGTTCCAGTACGGCTTCCTGATTTTTGGTGCTGCTGTNTCTG 
TCCTTCTACAGGAGGTGTTCCGCTTTGCCTANTACAAGCTGCTTAAGAAGGCAGATGAGGGG 
TTAGCATNGCTGAGTGAGGACGGAAGATCACCCATTTCCATCCGCCAGATGGCCTATGTTTN 
TGGTNTTTCCTTCGGTATCATCAGTGGTGTTTTNTCTGTTATCAATATTTTGGNTGATGCAN 
TTGGGCCAGGTGTGGTTGGGATCCATGGAGANTCACCCTATTAATTCCTGAATTCAGCCTTT 
NTGACAGCAGCCATTATCCTGNTCCATACCTTTTGGGGAGTTGTGTTTTTTGATGCCTGTGA 
GAGGAG 
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223/330 
FIGURE 223 

NGTTGGAGAAGTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 
GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCC 
CCAGNTGGNGCGCCCTTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACNTG 
ACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGCCCGGCCTTCG 
CGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGCA 
TTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGAC 
CGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTCC 
TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTTA 
GCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTGG 
TCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTTG 
GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 
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224/330 

FIGURE 224 

GTAAAAGAAAGTGGCCGGACCTTCATTGGGGTTTCGGTTCCCCCCTTTCCCNTTCCCCGGGG 
TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCCC 
CAGNTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACCTGA 
CCAGCCATGGGGGCTGCGGTGTTTTTCGGGCTGCACTTTCGTCGCGTTCGGGCCCGGCCTTC 
GCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGC 
ATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGA 
CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTC 
CTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTT 
AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTG 
GTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTT 
GGGCCAGGTGTGGTTGGGATCCATGGAGAC 
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225/330 
FIGURE 225 

GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAMTCCCCCACCCCC 
TACCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTCTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCT 
AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGT 
CCTTGTGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCC^TGGCTACAGCAAGACCCC 
CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTT 
CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCA 
GGACCTGGGAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCA 
ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAAC 
CAGCTCTACTGCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAG 
GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 
AGCAGATGTTCCAGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCT 
AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
CAACGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGT 
CAGAAAAGGTGCGAGGATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGA 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 
TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTAC 
ACGAACCTCTGCAAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCAT 
CCCAGGACTCAGCACACCGGCATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAG 
ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 
CCTGGACTCAGGGTCTGCTTCCCCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGG 
GAACAATTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 
CCTCAAGCTCAGGGCCCATCCCTTCTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAA 
CTGAGAAGTGGAAAAAAAAA 
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226/330 
FIGURE 226 

MATARPPWMWVLCALITALLLGVTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDD 
SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLS 
PVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCL 
VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGP 
WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 
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227/330 
FIGURE 227 

AT6GTCAACGACCGGTGGAAGACCATGGGCGGCGCTGCCCAACTTGAGGACCGGCCGCGCGA 
CAAGCCGCAGCGGCCGAGCTGCGGCTACGTGCTGTGCACCGTGCTGCTGGCCCTGGCTGTGC 
TGCTGGCTGTAGCTGTCACCGGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCCGGGCACG 
GCGCCCCCACCTGTCGTCAGCACTGGGGCTGCCAGCGCCAACAGCGCCCTGGTCACTGTGGA 
AAGGGCGGACAGCTCGCACCTCAGCATCCTCATTGACCCGCGCTGCCCCGACCTCACCGACA 
GCTTCGCACGCCTGGAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCAGGCC 
CAGCCACGGCTGGTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCC 
CCGGCTGCTGGCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATG 
GCACGCTGGGCCAGGGCCTCAGCGCCCTGCAGAGTGAGCAGGGCCGCCTCATCCAGCTTCTC 
TCTGAGAGCCAGGGCCACATGGCTCACCTGGTGAACTCCGTCAGCGACATCCT.GGATGCCCT 
GCAGAGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCC 
GGGGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 
CTAAGCGGACAGCAGGACGATGGCGTCTACTCTGTCTTTCCCACCCACTACCCGGCCGGCTT 
CCAGGTGTACTGTGACATGCGCACGGACGGCGGCGGCTGGACGGTGTTTCAGCGCCGGGAGG 
ACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGTACCGAGACGGCTTTGGCAGGCTCACC 
GGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGCCCTGACCACACAGGCTGCCTACGAGCT 
GCACGTGGACCTGGAGGACTTTGAGAATGGCACGGCCTATGCCCGCTACGGGAGCTTCGGCG 
TGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTGGCTGACTATTCC 
GGCACTGCAGGCGACTCCCTCCTGAAGCACAGCGGCATGAGGTTCACCACCAAGGACCGTGA 
CAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCCTGGTGGTACCGCAACT 
GCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCACGCCTCCTATGCCGACGGC 
GTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAGTTCTCTGAGATGAAGATCCG 
GCCGGTCCGGGAGGACCGCTAGACTGGTGCACCTTGTCCTTGGCCCTGCTGGTCCCTGTCGC 
CCCATCCCCGACCCCACCTCACTCTTTCGTGAATGTTCTCCACCCACCTGTGCCTGGCGGAC 
CCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGACACGAAGCTCCCTGGGCCGGTGAAGT 
CACACATCGCCTTCTCGCCGTCCCCACCCCCTCCATTTGGCAGCTCACTGATCTCTTGCCTC 
TGCTGATGGGGGCTGGCAAACTTGACGACCCCAACTCCTGCCTGCCCCCACTGTGACTCCGG 
TGCTGTTTGCCGTCCCCTGGCCAGGATGGTGGAGTCTGCCCCAGGCACCCTCTGCCCTGCCC 
GGCCAAATACCCGGCATTATGGGGACAGAGAGCAGGGGGCAGACAGCACCCCTGGAGTCCTC 
CTAGCAGATCGTGGGGAATGTCAGGTCTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAG 
CCCTCCCAATGCCAACCCCCACCCCGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 
GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATGGG 
GAGCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAGGCCACCCAC 
CCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGCTCTCTG 
TCTCAAATGAGGCCCAACCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCTGGGGCAGC 
CGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCACCGTGGGGCT 
GGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCTAGGCAGGGCTGGGATGAG 
GCTTGTACAACCCCCACCACCAATTTCCCAGGGACTCCAGGGTCCTGAGGCCTCCCAGGAGG 
GCCTTGGGGGTGATGACCCCTTCCCTGAGGTGGCTGTCTCCATGAGGAGGCCAACCCTTGCC 
ATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCGGCGAGTGGTCAAGGGACAGGGA 
CCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCAGACCAGGCACCACCTGGACA 
CTTTCTTGTTGAATCCTCCCAACACCCAGCACGCTGTCATCCCCACTCCTTGTGTGCACACA 
TGCAGAGGTGAGACCCGCAGGCTCCCAGGACCAGCAGCCACAAGGGCAGGGCTGGAGCCGGG 
TCCTCAGCTGTCTGCTCAGCAGCCCTGGACCCGCGTGCGTTACGTCAGGCCCAGATGCAGGG 
CGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCGAAAGGGCTGGAGTCAGCCTTGGGGAGCT 
GCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAGGTGGCTTCCTCCAAAGGACACCCGATGGCA 
GGTGCCTAGGGGGTGTGGGGTTCCGTTCTCCCTTCCCCTCCCACTGAAGTTTGTGCTTAAAA 
AACAATAAATTTGACTTGGCACCACTGGGGGTTGGTGGGAGAGGCCGTGTGACCTGGCTCTC 
TGTCCCAGTGCCACCAGGTCATCCACATGCGCAG 
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228/330 
FIGURE 228 

MVNDRWKTMGGAAQLEDRPRDKPQRPSCGYVLCTVLLALAVLLAVAVTGAVLFLNHAHAPGT 
APPPVVSTGAASANSALVTVERADSSHLSILIDPRCPDLTDSFARLESAQASVLQALTEHQA 
QPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQGRLIQLL 
SESQGHMAHLVNSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGSRPRDCLDVL 
LSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWDAYRDGFGRLT 
GEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPEEDGYPLTVADYS 
GTAGDSLLKHSGMRFTTKDRDSDHSENNCAAFYRGAWWYRNCHTSNLNGQYLRGAHASYADG 
VEWSSWTGWQYSLKFSEMKIRPVREDR 
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229/330 

FIGURE 229 

GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCT 
TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGC7VATCCGACCACATTTCACTCT 
CACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGCTGGATGATG 
ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATCCAGAGCCC 
CGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGACCCTGCTGAC 
TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACC 
AGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCC 
CAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGC 
TGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACAGGTGCAGCCCTTGTACAG 
AACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATAAAGACAGC7\AAAGTTGGGAG 
GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAACAAGAAGA 
CCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTT 
TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTG 
TTCCATATTATAATAGATGTCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGG 
GATGATCTTCTCAAAGGACTGCAAAGAATTGAAGCGTTGTGTCTGTGAGAGAAGGGCAGGAA 
TGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCGAAGGTGACTGATTCGCC 
CTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACAT 
TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 
TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACA 
AGAAGTCTTATTTACATGCCACCAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTG 
GCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCAT 
GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAAC 
ATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTACC 
AGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCAT 
CAGTAAAGACCCCATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAG 
AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTG 
TCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAG 
AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTCA 
TCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAG 
GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 



PCT/USOO/08439 

230/330 
FIGURE 230 

MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
IGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS 
QSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKD 
CKELKRCVCERRAGMVKPESLHVPPETLGEGD 
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231/330 
FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGNTGG 
ATGATGATGGGACACCACCATGAGCCTGCATTNTCAAGCTTTTGCCACAATTCGGCATCCAG 
AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACCAGTGGCCCTGACCCTG 
CTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTA 
CTACCAGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATA 
CGTCCCAAGAGTTGCAATTTNTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTNTGCAGCAT 
GTGGCTGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGGAACTTTGAAGGAGGGCAA 
AGTNTCCTCATNTACTATACACACACCACTTCCC 
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232/330 
FIGURE 232 

GCCGAGCGCAAGAACCCTGCGCAGCCCAGAGCAGCTGCTGGAGGGGAATCGAGGCGCGGCTC 
CGGGGATTCGGCTCGGGCCGCTGGCTCTGCTCTGCGGGGAGGGAGCGGGCCCGCCCGCGGGG 
CCCGAGCCCTCCGGATCCGCCCCCTCCCCGGTCCCGCCCCCTCGGAGACTCCTCTGGCTGCT 
CTGGGGGTTCGCCGGGGCCGGGGACCCGCGGTCCGGGCGCC^TGCGGGCATCGCTGCTGCTG 
TCGGTGCTGCGGCCCGCAGGGCCCGTGGCCGTGGGCATCTCCCTGGGCTTCACCCTGAGCCT 
GCTCAGCGTCACCTGGGTGGAGGAGCCGTGCGGCCCAGGCCCGCCCCAACCTGGAGACTCTG 
AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTGCAGCCCGGAGCG 
GAGCGCGAGAAGCCCGGGGCCGGCGAAGGCGCCGGGGAGAATTGGGAGCCGCGCGTCTTGCC 
CTACCACCCTGCACAGCCCGGCCAGGCCGCCAAAAAGGCCGTCAGGACCCGCTACATCAGCA 
CGGAGCTGGGCATCAGGCAGAGGCTGCTGGTGGCGGTGCTGACCTCTCAGACCACGCTGCCC 
ACGCTGGGCGTGGCCGTGAACCGCACGCTGGGGCACCGGCTGGAGCGTGTGGTGTTCCTGAC 
GGGCGCACGGGGCCGCCGGGCCCCACCTGGCATGGCAGTGGTGACGCTGGGCGAGGAGCGAC 
CCATTGGACACCTGCACCTGGCGCTGCGCCACCTGCTGGAGCAGCACGGCGACGACTTTGAC 
TGGTTCTTCCTGGTGCCTGACACCACCTACACCGAGGCGCACGGCCTGGCACGCCTAACTGG 
CCACCTCAGCCTGGCCTCCGCCGCCCACCTGTACCTGGGCCGGCCCCAGGACTTCATCGGCG 
GAGAGCCCACCCCCGGCCGCTACTGCCACGGAGGCTTTGGGGTGCTGCTGTCGCGCATGCTG 
CTGCAACAACTGCGCCCCCACCTGGAAGGCTGCCGCAACGACATCGTCAGTGCGCGCCCTGA 
CGAGTGGCTGGGTCGCTGCATTCTCGATGCCACCGGGGTGGGCTGCACTGGTGACCACGAGG 
GGGTGCACTATAGCCATCTGGAGCTGAGCCCTGGGGAGCCAGTGCAGGAGGGGGACCCTCAT 
TTCCGAAGTGCCCTGACAGCCCACCCTGTGCGTGACCCTGTGCACATGTACCAGCTGCACAA 
AGCTTTCGCCCGAGCTGAACTGGAACGCACGTACCAGGAGATCCAGGAGTTACAGTGGGAGA 
TCCAGAATACCAGCCATCTGGCCGTTGATGGGGACCGGGCAGCTGCTTGGCCCGTGGGTATT 
CCAGCACCATCCCGCCCGGCCTCCCGCTTTGAGGTGCTGCGCTGGGACTACTTCACGGAGCA 
GCACGCTTTCTCCTGCGCCGATGGCTCACCCCGCTGCCCACTGCGTGGGGCTGACCGGGCTG 
ATGTGGCCGATGTTCTGGGGACAGCTCTAGAGGAGCTGAACCGCCGCTACCACCCGGCCTTG 
CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGGCCCGGGGTATGGA 
ATACACGCTGGACTTGCAGCTGGAGGCACTGACCCCCCAGGGAGGCCGCCGGCCCCTCACTC 
GCCGAGTGCAGCTGCTCCGGCCGCTGAGCCGCGTGGAGATCTTGCCTGTGCCCTATGTCACT 
GAGGCCTCACGTCTCACTGTGCTGCTGCCTCTAGCTGCGGCTGAGCGTGACCTGGCCCCTGG 
CTTCTTGGAGGCCTTTGCCACTGCAGCACTGGAGCCTGGTGATGCTGCGGCAGCCCTGACCC 
TGCTGCTACTGTATGAGCCGCGCCAGGCCCAGCGCGTGGCCCATGCAGATGTCTTCGCACCT 
GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCCCGGGTGCCATGGCTCAG 
TGTGCAGACAGCCGCACCCTCACCACTGCGCCTCATGGATCTACTCTCCAAGAAGCACCCGC 
TGGACACACTGTTCCTGCTGGCCGGGCCAGACACGGTGCTCACGCCTGACTTCCTGAACCGC 
TGCCGCATGCATGCCATCTCCGGCTGGCAGGCCTTCTTTCCCATGCATTTCCAAGCCTTCCA 
CCCAGGTGTGGCCCCACCACAAGGGCCTGGGCCCCCAGAGCTGGGCCGTGACACTGGCCGCT 
TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACGTGGCAGCCCGTGGG 
CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTGGAGAGCCTGGATGTGTACGAGCT 
GTTCCTCCACTTCTCCAGTCTGCATGTGCTGCGGGCGGTGGAGCCGGCGCTGCTGCAGCGCT 
ACCGGGCCCAGACGTGCAGCGCGAGGCTCAGTGAGGACCTGTACCACCGCTGCCTCCAGAGC 
GTGCTTGAGGGCCTCGGCTCCCGAACCCAGCTGGCCATGCTACTCTTTGAACAGGAGCAGGG 
CAACAGCACCTGACCCCACCCTGTCCCCGTGGGCCGTGGCATGGCCACACCCCACCCCACTT 
CTCCCCCAAAACCAGAGCCACCTGCCAGCCTCGCTGGGCAGGGCTGGCCGTAGCCAGACCCC 
AAGCTGGCCCACTGGTCCCCTCTCTGGCTCTGTGGGTCCCTGGGCTCTGGACAAGCACTGGG 
GGACGTGCCCCCAGAGCCACCCACTTCTCATCCCAAACCCAGTTTCCCTGCCCCCTGACGCT 
GCTGATTCGGGCTGTGGCCTCCACGTATTTATGCAGTACAGTCTGCCTGACGCCAGCCCTGC 
CTCTGGGCCCTGGGGGCTGGGCTGTAGAAGAGTTGTTGGGGAAGGAGGGAGCTGAGGAGGGG 
GCATCTCCCAACTTCTCCCTTTTGGACCCTGCCGAAGCTCCCTGCCTTTAATAAACTGGCCA 
AGTGTGGAAAAA- 
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233/330 
FIGURE 233 

MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQPGDSELPPRGNTNAARRP 
NSVQPGAEREKPGAGEGAGENWEPRVLPYHPAQPGQAAKKAVRTRYISTELGIRQRLLVAVL 
TSQTTLPTLGVAVNRTLGHRLERVVFLTGARGRRAPPGMAWTLGEERPIGHLHLALRHLLE 
QHGDDFDWFFLVPDTTYTEAHGLARLTGHLSLASAAHLYLGRPQDFIGGEPTPGRYCHGGFG 
VLLSRMLLQQLRPHLEGCRNDIVSARPDEWLGRCILDATGVGCTGDHEGVHYSHLELSPGEP 
VQEGDPHFRSALTAHPVRDPVHMYQLHKAFARAELERTYQEIQELQWEIQNTSHLAVDGDRA 
AAWPVGIPAPSRPASRFEVLRWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 
RRYHPALRLQKQQLVNGYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEI 
LPVPYVTEASRLTVLLPLAAAERDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRVA 
HADVFAPVKAHVAELERRFPGARVPWLSVQTAAPSPLRLMDLLSKKHPLDTLFLLAGPDTVL 
TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 
DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARLSEDL 
YHRCLQSVLEGLGSRTQLAMLLFEQEQGNST 
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234/330 
FIGURE 234 

GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 
TGGCAAGCGCTGGCCACCTCCCCACACCCCTTGCGAACGCTCCCCTAGTGGAGAAAAGGAGT 
AGCTATTAGCCAATTCGGCAGGGCCCGCTTTTTAGAAGCTTGATTTCCTTTGAAGATGAAAG 
ACTAGCGGAAGCTCTGCCTCTTTCCCCAGTGGGCGAGGGAACTCGGGGCGATTGGCTGGGAA 
CTGTATCCACCCAAATGTCACCGATTTCTTCCTATGCAGGAAATGAGCAGACCCATCAATAA 
GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACTGGAGGCAAAG 
AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGGGACCTATATAGCGTG 
GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACTGCAGCTGCGCACAGTCGCATTTCT 
TTCCCCGCCCCTGAGACCCTGCAGCACCATCTGTCATGGCGGCTGGGCTGTTTGGTTTGAGC 
GCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGCGTCCGCTGGGA 
ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 
CGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACCCA 
GACTCCCATGGTTATGACAAGGACCCCGTTTTGGACGTCTGGAACATGCGACTTGTCTTCTT 
CTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACCTTTGTGGCCTATCTGCCTGACTACA 
GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 
CTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCAAGATCCAGCTGCCAGAGGATGAGTG 
ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCATTCTGAC 
CTCTTCTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 
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235/330 
FIGURE 235 

MAAGLFGLSARRLLAAAATRGLPAARVRWESSFSRTWAPSAVAGKRPPEPTTPWQEDPEPE 
DENLYEKNPDSHGYDKDPVLDVWNMRLVFFFGVSIILVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 



wo 00/73454 




PCT/lISOO/08439 



236/330 
FIGURE 236 

GGCGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GTGGCGGGAAAGCGGCCCCCAGAACCGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGA 
CGAAAACTTGTATGAGAAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACC 
TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 
TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCA 
AGATCCAG 
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237/330 
FIGURE 237 

GCGGCGGCT ATG CCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCCGGCGGCTGGTGCCT 
TGCAGAACCCCCACGCGACAGCCTGCGGGAGGAACTTGTCATCACCCCGCTGCCTTCCGGGG 
ACGTAGCCGCCACATTCCAGTTCCGCACGCGCTGGGATTCGGAGCTTCAGCGGGAAGGAGTG 
TCCCATTACAGGCTCTTTCCCAAAGCCCTGGGGCAGCTGATCTCCAAGTATTCTCTACGGGA 
GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGGGGCCACCCTTCCTGC 
AGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTTCCAAGACACTGTCACTGATGTGGATAAA 
TCTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAACTTCATCGA 
CTCCACCAACACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGACACTG 
ACCACTACTTTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCTGCACCGAAAACCTCACC 
CCCTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGA 
TCGCTTGTTCCACACCAGCTACCACTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATG 
CACGCTGTACTAGCATCTCCTGGGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 
ATCACGGGGCAGGGAAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCTCACGGA 
GCCCTGCCCCCTGGCTTCAGAGAGCCGAGTCTATGTGGACATCACCACCTACAACCAGGACA 
ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 
CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCGAAACCT 
CAACATCCAGCTCAAGTGGAAGAGACCCCCAGAGAATGAGGCCCCCCCAGTGCCCTTCCTGC 
ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 
AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGACACCGTACCCTGGTATCTGCG 
GCTGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAACAAACCAAGTTACATCC 
ACTACCAGCCTGCCCAGGACCGGCTGCAACCCCACCTCCTGGAGATGCTGATTCAGCTGCCG 
GCCAACTCAGTCACCAAGGTTTCCATCCAGTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 
CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 
GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAACAGCCTGTTCCCA 
GTCTCTGATGGCTCTAACTACTTTGTGCGGCTCTACACGGAGCCGCTGCTGGTGAACCTGCC 
GACACCGGACTTCAGCATGCCCTACAACGTGATCTGCCTCACGTGCACTGTGGTGGCCGTGT 
GCTACGGCTCCTTCTACAATCTCCTCACCCGAACCTTCCACATCGAGGAGCCCCGCACAGGT 
GGCCTGGCCAAGCGGCTGGCCAACCTTATCCGGCGCGCCCGAGGTGTCCCCCCACTCTGATT 
CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 
TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGTCAGGGC 
CTACAGCTGTGTTGTCCAGTACAGGAGCCACGAGCCAAATGTGGCATTTGAATTTGAATTAA 
CTTAGAAATTCATTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTGCTCAATAAGCAAA 
AGTGGTCGGTGGCTGCTGTATTGGACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 
GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGTATGTCACTGTGTAGTGGA 
TGGAGTTTACTGTTTGTGGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAAA 
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238/330 
FIGURE 23S 

MPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFQFRTRWDSELQREGVSHY 
RLFPKALGQLISKYSLRELHLSFTQGFWRTRYWGPPFLQAPSGAELWVWFQDTVTDVDKSWK 
ELSNVLSGIFCASLNFIDSTNTVTPTASFKPLGLANDTDHYFLRYAVLPREVVCTENLTPWK 
KLLPCSSKAGLSVLLKADRLFHTSYHSQAVHIRPVCRNARCTSISWELRQTLSVVFDAFITG 
QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 
YAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTH 
PYRAFPVLLLDTVPWYLRLYVHTLTITSKGKENKPSYIHYQPAQDRLQPHLLEMLIQLPANS 
VTKVSIQFERALLKWTEYTPDPNHGFYVSPSVLSALVPSMVAAKPVDWEESPLFNSLFPVSD 
GSNYFVRLYTEPLLVNLPTPDFSMPYNVICLTCTWAVCYGSFYNLLTRTFHIEEPRTGGLA 
KRLANLIRRARGVPPL 
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239/330 
FIGURE 239 

CAAC&T6GGGTCCAGCAGCTTCTTGGTCCTCATGGTGTCTCTCGTTCTTGTGACCCTGGTGG 
CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTTGCCCAGCTGACAACGTACGC 
TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACCAGGACTGTCTGGGGGAAAGGAAGTG 
TTGTTACCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 
ACAAGGATGAAGATGTGTCAAGGCCATACCCTGAGCCAGGATGGGAGGCCAAGTGTCCAGGC 
TCCTCCTCTACCAGGTGTCCTCAGAAATG^TGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 
TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAATATGGAAGAAGCAATACCCAACC 
CCACCAAAGAAAACCTGAGCTTGAAGTCCTTTTCCCCAAAAAGAGGGAAGAGTCACAAAAAG 
TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCTGGTGCTCCTCCCTAATGCTCATGAAT 
GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCTTGCCCTTC 
TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 
CTGAATCTGCTGGTGCCTTGATCTTGGACTTCCCAGCCTCTAGAACTGTAAGAAATAAATAT 
TTGCTGTTTATAATCCAA 
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240/330 
FIGURE 240 

MGSSSFLVLMVSLVLVTLVAVEGVKEGIEKAGVCPADNVRCFKSDPPQCHTDQDCLGERKCC 
YLHCGFKCVIPVKELEEGGNKDEDVSRPYPEPGWEAKCPGSSSTRCPQK 

Signal sequence: 

amino acids 1-19 

N-myristoylation sites; 

amino acids 23-29, 27-33, 32-38, 102-108 

WAP- type ' f our-disulf ide core' domain signature: 

amino acids 49-63 
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241/330 
FIGURE 241 

AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGG 
AAACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCC 
TCTAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGG 
CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTA 
TTAAGGAGCCTCAAACAAAGCCTTCCAGGCATCAACGCACAGAGAACATTAAAGAAAGGTCT 
CTACAGTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTA 
TGCAGAGCCAGCGCCAGAGAACAATGCCCTCAACACACAAACCCAGCCCAAGGCCCACACCA 
CCGGAGACAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCAC 
ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAG7WUVCCATGGTGAACACACTGTC 
ACCCAGAGGGCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCC 
AGGACACAAAGACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACG 
GTGTCAGAGAAGCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCA 
GCACAGAATGCTGGCTCCCACAGGAGCAGTGTCAACAAGGACGAGACAGAAAGGAGTGACCA 
CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAG 
AGCCCCACGACGCAGAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTG 
GGATTTTGAGGAAAAATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 
TGAAGATCAAAGCCTCCAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTC 
TTCCTGGACTCCAGACACTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACC 
CTTTGGCTTCATGGAGCTCAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAG 
TGCCCCAGCAGCAGCTGCTCCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGT 
GCCGTGGTGGGCAACGGGGGCATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCA 
CGACTACGTGTTCCGATTGAGCGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTC 
GGACATCCTTCTACGGCTTTACCGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAAT 
CGGGGTTTCAAGAACGTGCCTCTTGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCAC 
CCGGGACTATGAGTGGCTGGAAGCACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTT 
TCTGGTTCAGGCACAGACCCCAGGAAGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTG 
TTGCTGCACCCAGACTTTCTCCGATACATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGA 
TGGTGCCCACTGGAGGATATACCGCCCCACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTC 
AGCTCTGTGACCAGGTGAGTGCTTATGGCTTCATCACTGAGGGCCATGAGCGCTTTTCTGAT 
CACTACTATGATACATCATGGAAGCGGCTGATCTTTTACATAAACCATGACTTCAAGCTGGA 
GAGAGAAGTCTGGAAGCGGCTACACGATGAAGGGATAATCCGGCTGTACCAGCGTCCTGGTC 
CCGGAACTGCCAAAGCCAAGAACTGACCGGGGCCAGGGCTGCCATGGTCTCCTTGCCTGCTC 
CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACTAA 
GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 
GGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 
GATTTTTAATTAAATGGGGTGATGGGTGGCCAATACCACAATTCCTGCTGAAAAACACTCTT 
CCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATTTACAGA7VACATATAGATCTG 
GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 
AGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 
TCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 
GTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAAAAAAAAAA 
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242/330 
FIGURE 242 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKP 
KSQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 
KSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQG 
KAATTAKTLIPKSQHRMLAPTGAVSTRTRQKGVTTAVIPPKEKKPQATPPPAPFQSPTTQRN 
QRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHF 
NQSEWDRLEHFAPPFGFMELNYSLVQKWTRFPPVPQQQLLLASLPAGSLRCITCAWGNGG 
ILNNSHMGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVP 
LGKDVRYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFL 
RYMKNRFLRSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSW 
KRLIFYINHDFKLEREVWKRLHDEGIIRLYQRPGPGTAKAKN 

Cytoplasmic Domain: 

amino acids 1-10 

Type II Transmembrane Domain: 

amino acids 11-35 

Liimenal catalytic Domain: 

amino acids 36-600 

Ribonucleotide Reductase small subunit Signature: 

amino acids 481-496 

M-glycosylation Sites: 

amino acids 300-303, 311-314, 331-334, 375-378, 460-463 
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243/330 
FIGURE 243 

CGMGCGCGGACCCGGGCACCCCCTCCTCCTGGGGCTGCTGCTGGTGCTGGGGCCTTCGCCG 
GAGCAGCGAGTGGATIATTGTTCCTCGAGATCTGAGGATGAAGGACAAGTTTCTAAAACACCT 
TACAGGCCCTCTTTATTTTAGTCCAAAGTGCAGCAAACACTTCCATAGACTTTATCACAACA 
CCAGAGACTGCACCATTCCTGCATACTATAAAAGATGCGCCAGGCTTCTTACCCGGCTGGCT 
GTCAGTCCAGTGTGCATGGAGGATAAGTGAGCAGACCGTACAGGAGCAGCACACCAGGAGCC 
ATGAGAAGTGCCTTGGAAACCAACAGGGAAACAGAACTATCTTTATACACATCCCCTCATGG 
ACAAGAGATTTATTTTTGCAGACAGACTCTTCCATAAGTCCTTTGAGTTTTGTATGTTGTTG 
ACAGTTTGCAGATATATATTCGATAAATCAGTGTACTTGACAGTGTTATCTGTCACTTATTT 
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244/330 
FIGURE 244 

MRGPGHPLLLGLLLVLGPSPEQRVEIVPRDLRMKDKFLKHLTGPLYFSPKCSKHFHRLYHNT 
RDCTIPAYYKRCARLLTRLAVSPVCMEDK 



PCTAJSOO/08439 

245/330 
FIGURE 245 

GGGCTGGGCCCCGCCGCAGCTCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 
CCCGACCCCGGCCGCGCCCAGCCCCCACCA2GCCACCCGCGGGGCTCCGCCGGGCCGCGCCG 
CTCACCGCAATCGCTCTGTTGGTGCTGGGGGCTCCCCTGGTGCTGGCCGGCGAGGACTGCCT 
GTGGTACCTGGACCGGAATGGCTCCTGGCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 
GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTGCTTATCACCGAGAGG 
CAGCAGAAGCACTGCCTGGCCTTCAGCCCCAAGACCATAGCAGGCATCGCCTCAGCTGTGAT 
CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTCTGTTCCTGTTGCTACCTGT 
ACCGCCGGCGCCAGCAGCTCCAGAGCCCATTTGAAGGCCAGGAGATTCCAATGACAGGCATC 
CCAGTGCAGCCAGTATACCCATACCCCCAGGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 
TGGCTTCATGTACCCACCTAGTGGTCCTGCTCGCCAATATCCACTCTACCCAGCTGGGCCCC 
CAGTCTACAACCCTGCAGCTCCTCCTCCCTATATGCCACCACAGCCCTCTTACCCGGGAGCC 
TGAGGAACCAGCCATGTCTCTGCTGCCCCTTCAGTGATGCCAACCTTGGGAGATGCCCTCAT 
CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCCCCAGACCAA 
GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 
ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGCAAGGGAGG 
GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 
CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 
TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTGCCCCACTGGCCACATCTCTGGCCTG 
CTAGATTAAAGCTGTAAAGACAAAA 
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246/330 
FIGURE 246 

MPPAGLRRAAPLTAIALLVLGAPLVLAGEDCLWYLDRNGSWHPGFNCEFFTFCCGTCYHRYC 
CRDLTLLITERQQKHCLAFSPKTIAGIASAVILFVAWATTICCFLCSCCYLYRRRQQLQSP 
FEGQEIPMTGIPVQPVYPYPQDPKAGPAPPQPGFMYPPSGPAPQYPLYPAGPPVYNPAAPPP 
YMPPQPSYPGA 

Transmembrane Domains: 

amino acids 10-28, 85-110 

N-glycosylation Site: 

amino acids 38-41 



N-myristoylation Sites: 

amino acids 5-10, 88-93 
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247/330 

GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAA 
CCCCAGGTAGGTAGAGCAAGAAGATGGTGTTTCTGCCCCTCAAATGGTCCCTTGCAACCATG 
TCATTTCTACTTTCCTCACTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAG 
CACTGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACGACTTC 
CTGAGTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACC 
TTCTGGGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCA 
TAGTCACCACCTGCAGATATCTAGGGCCACCCTCAGGAAGGGAGCTGGAGAGAGGCTATCGG 
AAGAACCCCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAG 
CCCCTCCTTGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGAC 
TTTCCACGGATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCAT 
CAACACAATTTGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTC 
AAAGCAAGTTTCTCAATCAAAATTAGAAGAGAGCCAAGGCACCTAGCCATCTCCAATATGCC 
ATTGGTGAAATCTGTGACTGTTGCTGAAGGACTCATAGAAGACCATTTTGATGTCACTGTGA 
AGATGAGCACCTATCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACC 
AAGAGTGGAGTCAAGGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGC 
ACTGGATGCTGCGGTGACTCTTCTAGAATTTTATGAGGATTATTTCAGCATACCGTATCCCC 
TACCCAAACAAGATCTTGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAMCTGGGGA 
CTGACAACATATAGAGAATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAG.TAA 
GCTTGGCATCACAGTGACTGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCA 
CTATGGAATGGTGGAATGATCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTG 
TCTGTCAGTGTGACCCATCCTGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGA 
CGCAATGGAGGTAGATGCTTTAAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTG 
CTCAGATCCGGGAGATGTTTGATGATGTTTCTTATGATAAGGGAGCTTGTATTCTGAATATG 
CTAAGGGAGTATCTTAGCGCTGACGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCA 
TAGCTATAAAAATACAAAAAACGAGGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAG 
ATGGTGTAAAAGGGATGGATGGCTTTTGCTCTAGAAGTCAACATTCATCTTCATCCTCACAT 
TGGCATCAGGAAGGGGTGGATGTGAAAACCATGATGAACACTTGGACACTGCAGAGGGGTTT 
TCCCCTAATAACCATCACAGTGAGGGGGAGGAATGTACACATGAAGCAAGAGCACTACATGA 
AGGGCTCTGACGGCGCCCCGGACACTGGGTACCTGTGGCATGTTCCATTGACATTCATCACC 
AGCAAATCCAACATGGTCCATCGATTTTTGCTAAAAACAAAAACAGATGTGCTCATCCTCCC 
AGAAGAGGTGGAATGGATCAAATTTAATGTGGGCATGAATGGCTATTACATTGTGCATTACG 
AGGATGATGGATGGGACTCTTTGACTGGCCTTTTAAAAGGAACACACACAGCAGTCAGCAGT 
AATGATCGGGCAAGTCTCATTAACAATGCATTTCAGCTCGTCAGCATTGGGAAGCTGTCCAT 
TGAAAAGGCCTTGGATTTATCCCTGTACTTGAAACATGAAACTGAAATTATGCCCGTGTTTC 
AAGGTTTGAATGAGCTGATTCCTATGTATAAGTTAATGGAGAAAAGAGATATGAATGAAGTG 
GAAACTCAATTCAAGGCCTTCCTCATCAGGCTGCTAAGGGACCTCATTGATAAGCAGACATG 
GACAGACGAGGGCTCAGTCTCAGAGCAAATGCTGCGGAGTGAACTACTACTCCTCGCCTGTG 
TGCACAACTATCAGCCGTGCGTACAGAGGGCAGAAGGCTATTTCAGAAAGTGGAAGGAATCC 
AATGGAAACTTGAGCCTGCCTGTCGACGTGACCTTGGCAGTGTTTGCTGTGGGGGCCCAGAG 
CACAGAAGGCTGGGATTTTCTTTATAGTAAATATCAGTTTTCTTTGTCCAGTACTGAGAAAA 
GCCAAATTGAATTTGCCCTCTGCAGAACCCAAAATAAGGAAAAGCTTCAATGGCTACTAGAT 
GAAAGCTTTAAGGGAGATAAAATAAAAACTCAGGAGTTTCCACAAATTCTTACACTCATTGG 
CAGGAACCCAGTAGGATACCCACTGGCCTGGCAATTTCTGAGGA7\AAACTGGAACAAACTTG 
TACAAAAGTTTGAACTTGGCTCATCTTCCATAGCCCACATGGTAATGGGTACAACAAATCAA 
TTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAGCTCTTTGAAAGAAAATGG 
TTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGAAAACATCGGTTGGATGG 
ATAAGAATTTTGATAAAATCAGAGTGTGGCTGCAAAGTGAAAAGCTTGAACGTATGTAAAAA 
TTCCTCCCTTGCCCGGTTCCTGTTATCTCTAATCACCAACATTTTGTTGAGTGTATTTTCAA 
ACTAGAGATGGCTGTTTTGGCTCCAACTGGAGATACTTTTTTCCCTTCAACTCATTTTTTGA 
CTATCCCTGTGAAAAGAATAGCTGTTAGTTTTTCATGAATGGGCTTTTTCATGAATGGGCTA 
TCGCTACCATGTGTTTTGTTCATCACAGGTGTTGCCCTGCAACGTAAACCCAAGTGTTGGGT 
TCCCTGCCACAGAAGAATAAAGTACCTTATTCTTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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248/330 
FIGURE 248 

MVFLPLKWSLATMSFLLSSLLALLTVSTPSWCQSTEASPKRSDGTPFPWNKIRLPEYVIPVH 
YDLLIHANLTTLTFWGTTKVEITASQPTSTIILHSHHLQISRATLRKGAGERLSEEPLQVLE 
HPPQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTA 
ARMAFPCFDEPAFKASFSIKIRREPRHLAISNMPLVKSVTVAEGLIEDHFDVTVKMSTYLVA 
FIISDFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDLAA 
IPDFQSGAMENWGLTTYRESALLFDAEKSSASSKLGITVTVAHELAHQWFGNLVTMEWWNDL 
WLNEGFAKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFD 
DVSYDKGACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMASICPTDGVKGMDG 
FCSRSQHSSSSSHWHQEGVDVKTMMNTWTLQRGFPLITITVRGRNVHMKQEHYMKGSDGAPD 
TGYLWHVPLTFITSKSNMVHRFLLKTKTDVLILPEEVEWIKFNVGMNGYYIVHYEDDGWDSL 
TGLLKGTHTAVSSNDRASLINNAFQLVSIGKLSIEKALDLSLYLKHETEIMPVFQGLNELIP 
MYKLMEKRDMNEVETQFKAFLIRLLRDLIDKQTWTDEGSVSEQMLRSELLLLACVHNYQPCV 
QRAEGYFRKWKESNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALC 
RTQNKEKLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGS 
SSIAHMVMGTTNQFSTRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIR 
VWLQSEKLERM 

Signal peptide: 

amino acids 1-34 

N-glycosyla-bion sites : 

amino acids 70-74, 154-158, 414-418, 760-764, 901-905 

Neutral zinc metallopeptidases , zinc-binding region signature: 

amino acids 350-360 
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249/330 
FIGURE 249 

CAGCCACAGACGGGTC ATGA GCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCAC 
TGCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCC 
GACCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGA 
CACGTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGG 
AGGCCAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATC 
TCCTACACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCT 
TTGGGCCCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGG 
AAGGCTGTCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGAT 
GGCCTCCTCAGGCTCAGGGGAGGAGGCATCTTCTCCAATCTGAGAGTCCAGGGATGCATGCC 
CCAGCCAGGTTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGA 
ACTGCAATAGGATU^GATTTTCTGACCTGTCATCGGGGGACCACCATTATGACACACGGAAAC 
TTGGCTCAAGAACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGT 
GTGTCAGGAGACGCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAG 
GCTGCAGCACTGTTGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGG 
GTGCTTGTGGCCTCCTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAG 
CAGCGTTCTGCTGAACTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTC 
CTACCTGTGTGCAGCCCCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGG 
GGCGCCACTCATTGTTATGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAAT 
GAGCATTCAGGGCTGCGTGGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCG 
GGATCTTCTCTGCGCGTGAGAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGT 
GGGGCTGAGGGCCTGGAGTCTCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTG 
GTGGGGAGTGGTTTGCCCTTCCTGCTAACTCTATTACCCCCACGATTCTTCACCGCTGCTGA 
CCACCCACACTCAACCTCCCTCTGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTC 
CCATTCTGTCCATGAATCATCTTCCCCACACACAATCATTCATATCTACTCACCTAACAGCA 
ACACTGGGGAGAGCCTGGAGCATCCGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTG 
GCTGCATGTATCTGATAATACAGACCCTGTCCTTTCA 
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250/330 
FIGURE 250 

MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLI 
ESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQP 
PADPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRGGGIFSNLRVQGCMPQPGCN 
LLNGTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETL 
LLIDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLN 
SLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGC 
VAQPSSFLLNHTRQIGIFSAREKRDVQPPASQHEGGGAEGLESLTWGVGLALAPALWWGWC 
PSC 
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251/330 
FIGURE 251 

GCGACGGGCAGGACGCCCCGTTCGCCTAGCGCGTGCTCAGGAGTTGGTGTCCTGCCTGCGCT 
CAGGAT6AGGGGGAATCTGGCCCTGGTGGGCGTTCTAATCAGCCTGGCCTTCCTGTCACTGCTG 
CCATCTGGACATCCTCAGCCGGCTGGCGATGACGCCTGCTCTGTGCAGATCCTCGTCCCTGG 
CCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGCGCCCCCGGACGGCCTGGAAGAGTCG 
GCCCCACGGGAGAAAAAGGAGACATGGGGGACTW^GGACAGAAAGGCAGTGTGGGTCGTCAT 
GGAAAAATTGGTCCCATTGGCTCTAAAGGTGAGAAAGGAGATTCCGGTGACATAGGACCCCC 
TGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGCAGCCAGCTGCGCAAGGCCATCGGGG 
AGATGGACAACCAGGTCTCTCAGCTGACCAGCGAGCTCAAGTTCATCAAGAATGCTGTCGCC 
GGTGTGCGCGAGACGGAGAGCMGATCTACCTGCTGGTGAAGGAGGAGAAGCGCTACGCGGA 
CGCCCAGCTGTCCTGCCAGGGCCGCGGGGGCACGCTGAGCATGCCCAAGGACGAGGCTGCCA 
ATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTGGCCCGTGTCTTCATCGGCATCAAC 
GACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGACCACTCCCCCATGCGGACCTTCAACAA 
GTGGCGCAGCGGTGAGCCCAACAATGCCTACGACGAGGAGGACTGCGTGGAGATGGTGGCCT 
CGGGCGGCTGGAACGACGTGGCCTGCCACACCACCATGTACTTCATGTGTGAGTTTGACAAG 
GAGAACATGTGAGCCTCAGGCTGGGGCTGCCCATTGGGGGCCCCACATGTCCCTGCAGGGTT 
GGCAGGGACAGAGCCCAGACCATGGTGCCAGCCAGGGAGCTGTCCCTCTGTGAAGGGTGGAG 
GCTCACTGAGTAGAGGGCTGTTGTCTAAACTGAGAAAATGG'CCTATGCTTAAGAGGAAAATG 
AAAGTGTTCCTGGGGTGCTGTCTCTGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCA 
ATGTCATTATGTAATTATTACCCAGAATTGCTCTTCCATAAAGCTTGTGCCTTTGTCCAAGC 
TATACAATAAAATCTTTAAGTAGTGCAGTAGTTAAGTCCAAAAAAAAAAAAAAAAAAA 
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252/330 
FIGURE 252 

MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGRVG 
PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGE 
MDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAAN 
GLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVAS 
GGWNDVACHTTMYFMCEFDKENM 
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253/330 
FIGURE 253 

AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGCAG 
CACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTGCCAGACCACC 
CAGCCCCCGGCACAGAGCTGCTCCACAGGCACCATGAGGATCATGCTGCTATTCACAGCCAT 
CCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCTGTAAGGAGCCACAGGAGGAGGTGG 
TTCCTGGCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTACCAGCTGCTCCAGAGACTCTTC 
AAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCAGGCTAGCACAGATCCTAA 
GGAATCAACATCTCCCGAGAAACGTGACATGCATGACTTCTTTGTGGGACTTATGGGCAAGA 
GGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCC 
CTTCATCCCAATCAGCTTGGATCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACC 
TTTATAAGACTCTCCTACGGATGTGAATCAAGAGAACGTCCCCAGCTTTGGCATCCTCAAGT 
ATCCCCCGAGAGCAGAATAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTC 
TTTCCCTGTCCCAATCCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTCCCTGTTCTTGT 
AACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGGAAACTGCA 
TAGTGAATATCCCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGT 
CCTACATTAAAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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254/330 
FIGURE 254 

MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLK 
ALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLGSTGK 
SSLGTEEQRPL 

Important features: 
Signal peptide: 

amino acids 1-18 

Tyrosine kinase phosphorylation site. 

amino acids 36-45 

N-myrlstoylatlon site. 

amino acids 33-39, 59-65 

Azoldatlon site. 

amino acids 90-94 

Leucine zipper pattern. 

amino acids 43-65 



Tachykinin family signature. 

amino acids 86-92 
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255/330 
FIGURE 255 

GGGCGTCTCCGGCTGCTCCTATTGAGCTGTCTGCTCGCTGTGCCCGCTGTGCCTGCTGTGCC 
CGCGCTGTCGCCGCTGCTACCGCGTCTGCTGGACGCGGGAGACGCCAGCGAGCTGGTGATTG 
GAGCCCTGCGGAGAGCTCAAGCGCCCAGCTCTGCCCCAGGAGCCCAGGCTGCCCCGTGAGTC 
CCATAGTTGCTGCAGGAGTGGAGCC^TGAGCTGCGTCCTGGGTGGTGTCATCCCCTTGGGGC 
TGCTGTTCCTGGTCTGCGGATCCCAAGGCTACCTCCTGCCCAACGTCACTCTCTTAGAGGAG 
CTGCTCAGCAAATACCAGCACAACGAGTCTCACTCCCGGGTCCGCAGAGCCATCCCCAGGGA 
GGACAAGGAGGAGATCCTCATGCTGCACAACAAGCTTCGGGGCCAGGTGCAGCCTCAGGCCT 
CCAACATGGAGTACATGGTGAGCGCCGGCTCCGGCCGCAGAGGCTGGCACCGGGGGTGGGGC 
CTGGGCCACCAGCCTGCTCTGTTCCCCAGCCAGCTCTGTTCCCCAGCCAGTGCGTGTGATGG 
CTGGCTCAGGGTCTCCTCTGGCAGGGGAGGATCCCGGCTCTGTTCTGTTTTGTTTGTTTGTT 
TTGAGACAGGGTCTCACTCTGCCACTGACGCTGGAGTGCAATGGCACAATCGTCATGCCCTG 
AAACCTTAGACTCCCGGGGTTAAGCGATCCTGCTTCAGCCTCCCAAGTAGCTGGAACTACAG 
GCATGCACCATGGTGCCCAGCTAGATTTTAAATATTTTGTGGAGATGGGGGTCTTGCTACGT 
TGCCCAGGCTGGTCTTGAACTCCTAGGCTCAAGCAATCCTCCTGCCTCAGCCTCTCAAAGTG 
CTAGGATTATAGGCATGAGTCACCCTGTCTGGCTCTGGCTCTGTTCTTAACATTCTGCCAAA 
ACMCACACGTGGGTTCCCTGTGCAGAGCCTGCCTCGTTGCCTTCATGTCACTCTTGGTAGC 
TCCACTGGGAACACAGCTCTCAGCCTTTCCCACCTGGAGGCAGAGTGGGGAGGGGCCCAGGG 
CTGGGCTTTGCTGATGCTGATCTCAGCTGTGCCACACGCTAGCTGCACCACCCTGACTTCTC 
CTTAGCCCGTGTGAGCCTCACTTTCCACTTGGAGAGTCCTTCCTCGCGTGGTTGCCATGACT 
GTGAGATAAGTCGAGGCTGTGAAGGGCCCGGCACAGACTGACCTGCCTCCCCAACCCCTAGG 
CTTTGCTAACCGGGAAAGGAGCTAACGGTGACAGAAGACAGCCAAGGTCAACCCTCCCGGGT 
GATTGTGATGGGTGTTCCAGGTGTGGTTGGGCGATGCTGCTACTTGACCCCAAGCTCCAGTG 
TGGAAACTTCCTTCCTGGCTGGTTTTCCAGAACTACAGAGGAATGGACCACAGTCTTCCAGG 
GTCCCTCCTCGTCCACCAACCGGGAGCCTCCACCTTGGCCATCCGTCAGCTATGAATGGCTT 
TTTAAACAAACCCACGTCCCAGCCTGGGTAACATGGTAAAGCCCCGTCTCTACAAAAAAATC 
CAAGTTAGCCGGGCATGGTGGTGCGCACCTGTAGTCCCAGCTGCAGTGGGACTGAGGTGGAG 
GTGGAGGTGGGGGGTGGGAGCTGAGGAAGGAGGATCGCTTGAGCCTGGGAAGTCGAGGCTGC 
AGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCfGTCTCAAAAA 
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256/330 



FIGURE 256 



MSCVLGGVIPLGLLFLVCGSQGYLLPNVTLLEELLSKYQHNESHSRVRRAIPREDKEEILML 
HNKLRGQVQPQASNMEYMVSAGSGRRGWHRGWGLGHQPALFPSQLCSPASACDGWLRVSSGR 
GGSRLCSVLFVCFETGSHSATDAGVQWHNRHALKP 

Important features: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 27-31, 41-45 

N-myristoylation site. 

amino acids 126-132, 140-146 

Amidation site. 

amino acids 85-89 
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257/330 
FIGURE 257 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGG 
GTCTGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGG 
GTATGACTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGC 
TTCCTGGAATTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCAC 
CCTCCACCATGCAAGATCTCAACACCATGTTGTCTGCAACACASG?^CAGCCATTGAAGCCTG 
TGTCCTTCTTGGCCCGGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTT 
CAGCAGGCCCCCACCCTCCTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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258/330 
FIGURE 258 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTS 
VTLHHARSQHHWCNT 
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259/330 

AATTGTATCTGTGTAATGTTAAAACAAACGAAATAAAATAGAAGGAAAAACTTTCTGAGTTT 
CAAAAACAACAGACTAGTACTCTAAAGAACTCTTTAAAACAATTAACTGTTAGGATTGCAGT 
TATGATTGGATATTATTTAATTCTGTTTCTGATGTGGGGTTCCTCCACTGTGTTCTGTGTGC 
TATTAATATTTACCATTGCAGAAGCTTCATTCAGTGTTGAAAATGAATGCTTAGTGGATCTG 
TGCCTCTTACGCATATGTTACAAATTATCTGGAGTTCCTAATCAATGCAGAGTTCCCCTCCC 
CTCCGATTGTTCTAAATAATTGAAAGATGTCTGCTGTGGAAAAAGGCATGTATTTAAATCTG 
TATGATTCTCAACCATCTTTAGTTGGGAAAGGTCCTTGAAAGCCAATGGAAATACTTTTTTT 
TTTTCTTGGCACTAATCAAGTGAGTGTTACCTTTTCACTTAGTAGGATGTGTTGTTACGCTA 
GTAAAATAGAAACCTGTGTTTATTCTCAGGTATTTTAGAAACAACAGCCATCATTTTATTTT 
ATGTGTGTGTTCTTGGCTGTATTCATAAATTATATATTTTGGGCTATCAAATATTACTTCAT 
TCAATATAAATAACAATAGTAGAAGTTGTTTACTTAGATATGCTTTCTAGTTGCATTTTCTC 
AGCCTATGTAAGACTACTTTGTTGTAATAGCCTTTGAAATTTACAGTACTGTCTCTCTACTA 
TCTTCAGATTACTTGATTCAAATAAACCAATTATGTTTGTAATTGATATTAATAAAACCAGA 
ATAAAAGTTCATATCTACCC 
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260/330 
FIGURE 260 

MIGYYLILFLMWGSSTVFCVLLIFTIAEASFSVENECLVDLCLLRICYKLSGVPNQCRVPLP 
SDCSK 

Intportant features: 
Signal peptide: 

amino acids 1-29 
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261/330 

FIGURE 261 

GAGGATTTGCCACAGCAGCGGATAGAGCAGGAGAGCACCACCGGAGCCCTTGAGACATCCTT 
GAGAAGAGCCACAGCATAAGAGACTGCCCTGCTTGGTGTTTTGCAGGATGATGGTGGCCCTT 
CGAGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTAC 
CCAGGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGG 
AAAAATGTACCCAAGCAACGAGGGCATACATTCAAGAATTCCAAGAGTTCTCAAAAAATATA 
TCTGTCATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTT 
GGCACTGAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTG 
ACGAGTGCATCGTATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAGAA 
GAGAAAAAGATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTC 
TTTGAAAATAGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATA 
ACTCTCCAAAGGTGTACTTATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAAC 
ATACGGGCATTCATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTC 
CTGGCAGGGAACAGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTT 
CTAATGAGATAATCAAATATAACCTGCAGAAGAGGACTGTGGAAGATCGAATGCTGCTCCCA 
GGAGGGGTAGGCCGAGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGT 
GGATGAGCATGGGCTCTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCA 
CAAAGATTGAGCCGGGCACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAG 
GATGCTG7VAGCCTCATTCCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCA 
GGGCCCTCATCGCATCACCTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGC 
CCAACTTGTTCTTCCCCAAGAGACCAAGAAGTCACTCCATGATCCATTACAACCCCAGAGAT 
AAGCAGCTCTATGCCTGGAATGAAGGAAACCAGATCATTTACAAACTCCAGACAAAGAGAAA 
GCTGCCTCTGAAG2AATGCATTACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTC 
TACAGGACAGTGAGGCTATAGCCCCTTCACAATATAGTATCCCTCTAATCACACACAGGAAG 
AGTGTGTAGAAGTGGAAATACGTATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTC 
CAAGAGCTTAGATGAGAGCATATCATCAGGAAAGTTTCAACAATGTCCATTACTCCCCCAAA 
CCTCCTGGCTCTCAAGGATGACCACATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACT 
GCTCCCCAGCATTTACTGTAACTCTGCCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGC 
CCCTAATATTCACCACTGGCTTTTCTCTCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTT 
CAAATGTCTATTGATATTCTCCCATTTTCACTGCCCAACTAAAATACTATTAATATTTCTTT 
CTTTTCTTTTCTTTTTTTTGAGACAAGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACTCC 
AGAGCTCAAGAGATCCTCCTGCCTCAGCCTCCTAAGTACCTGGGATTACAGGCATGTGCCAC 
CACACCTGGCTTAAAATACTATTTCTTATTGAGGTTTAACCTCTATTTCCCCTAGCCCTGTC 
CTTCCACTAAGCTTGGTAGATGTAATAATAAAGTGAAAATATTAACATTTGAATATCGCTTT 
CCAGGTGTGGAGTGTTTGCACATCATTGAATTCTCGTTTCACCTTTGTGAAACATGCACAAG 
TCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGTAACACAATTACAAAGTGAAAGATACAGC 
TAGAAAATACTACAAATCCCATAGTTTTTCCATTGCCCAAGGAAGCATCAAATACGTATGTT 
TGTTCACCTACTCTTATAGTCAATGCGTTCATCGTTTCAGCCTAAAAATAATAGTCTGTCCC 
TTTAGCCAGTTTTCATGTCTGCACAAGACCTTTCAATAGGCCTTTCAAATGATMTTCCTCC 
AGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCCTCCTCTTGTCTTGCTGTCCTCTGT 
TTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTGAGCAAAAAAAAAAAAAAA 



wo 00/73454 



PCT/USOO/08439 



262/330 

FIGURE 262 

MMVALRGASALLVLFLAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQE 
FSKNISVMLGRCQTYTSEYKSAVGNLALRVERAQREIDYIQYLREADECIVSEDKTLAEMLL 
QEAEEEKKIRTLLNASCDNMLMGIKSLKIVKKMMDTHGSWMKDAVYNSPKVYLLIGSRNNTV 
WEFANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEIIKYNLQKRTVED 
RMLLPGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDT 
PCRSQDAEASFLLCGVLYVVYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIH 
YNPRDKQLYAWNEGNQIIYKLQTKRKLPLK 



I 
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263/330 
FIGURE 263 

GGGCGCCCGCGTACTCACTAGCTGAGGTGGCAGTGGTTCCACCAACATGGAGCTCTCGCAGA 
TGTCGGAGCTCATGGGGCTGTCGGTGTTGCTTGGGCTGCTGGCCCTGATGGCGACGGCGGCG 
GTAGCGCGGGGGTGGCTGCGCGCGGGGGAGGAGAGGAGCGGCCGGCCCGCCTGCCAAAAAGC 
AAATGGATTTCCACCTGACAAATCTTCGGGATCCAAGAAGCAGAAACAATATCAGCGGATTC 
GGAAGGAGAAGCCTCAACAACACAACTTCACCCACCGCCTCCTGGCTGCAGCTCTGAAGAGC 
CACAGCGGGAACATATCTTGCATGGACTTTAGCAGCAATGGCAAATACCTGGCTACCTGTGC 
AGATGATCGCACCATCCGCATCTGGAGCACCAAGGACTTCCTGCAGCGAGAGCACCGCAGCA 
TGAGAGCCAACGTGGAGCTGGACCACGCCACCCTGGTGCGCTTCAGCCCTGACTGCAGAGCC 
TTCATCGTCTGGCTGGCCAACGGGGACACCCTCCGTGTCTTCAAGATGACCAAGCGGGAGGA 
TGGGGGCTACACCTTCACAGCCACCCCAGAGGACTTCCCTAAAAAGCACAAGGCGCCTGTCA 
TCGACATTGGCATTGCTAACACAGGGAAGTTTATCATGACTGCCTCCAGTGACACCACTGTC 
CTCATCTGGAGCCTGAAGGGTCAAGTGCTGTCTACCATCAACACCAACCAGATGAACAACAC 
ACACGCTGCTGTATCTCCCTGTGGCAGATTTGTAGCCTCGTGTGGCTTCACCCCAGATGTGA 
AGGTTTGGGAAGTCTGCTTTGGAAAGAAGGGGGAGTTCCAGGAGGTGGTGCGAGCCTTCGAA 
CTAAAGGGCCACTCCGCGGCTGTGCACTCGTTTGCTTTCTCCAACGACTCACGGAGGATGGC 
TTCTGTCTCCAAGGATGGTACATGGAAACTGTGGGACACAGATGTGGAATACAAGAAGAAGC 
AGGACCCCTACTTGCTGAAGACAGGCCGCTTTGAAGAGGCGGCGGGTGCCGCGCCGTGCCGC 
CTGGCCCTCTCCCCCAACGCCCAGGTCTTGGCCTTGGCCAGTGGCAGTAGTATTCATCTCTA 
CAATACCCGGCGGGGCGAGAAGGAGGAGTGCTTTGAGCGGGTCCATGGCGAGTGTATCGCCA 
ACTTGTCCTTTGACATCACTGGCCGCTTTCTGGCCTCCTGTGGGGACCGGGCGGTGCGGCTG 
TTTCACAACACTCCTGGCCACCGAGCCATGGTGGAGGAGATGCAGGGCCACCTGAAGCGGGC 
CTCCAACGAGAGCACCCGCCAGAGGCTGCAGCAGCAGCTGACCCAGGCCCAAGAGACCCTGA 
AGAGCCTGGGTGCCCTGAAGAAG TGA CTCTGGGAGGGCCCGGCGCAGAGGATTGAGGAGGAG 
GGATCTGGCCTCCTCATGGCACTGCTGCCATCTTTCCTCCCAGGTGGAAGCCTTTCAGTU^GG 
AGTCTCCTGGTTTTCTTACTGGTGGCCCTGCTTCTTCCCATTGAAACTACTCTTGTCTACTT 
AGGTCTCTCTCTTCTTGCTGGCTGTGACTCCTCCCTGACTAGTGGCCAAGGTGCTTTTCTTC 
CTCCCAGGCCCAGTGGGTGGAATCTGTCCCCACCTGGCACTGAGGAGAATGGTAGAGAGGAG 
AGGAGAGAGAGAGAGAATGTGATTTTTGGCCTTGTGGCAGCACATCCTCACACCCAAAGAAG 
TTTGTAAATGTTCCAGAACAACCTAGAGAACACCTGAGTACTAAGCAGCAGTTTTGCAAGGA 
TGGGAGACTGGGATAGCTTCCCATCACAGAACTGTGTTCCATCAAAAAGACACTAAGGGATT 
TCCTTCTGGGCCTCAGTTCTATTTGTAAGATGGAGAATAATCCTCTCTGTGAACTCCTTGCA 
AAGATGATATGAGGCTAAGAGAATATCAAGTCCCCAGGTCTGGAAGAAAAGTAGAAAAGAGT 
AGTACTATTGTCCAATGTCATGAAAGTGGTAAAAGTGGGAACCAGTGTGCTTTGAAACCAAA 
TTAGAAACACATTCCTTGGGAAGGCAAAGTTTTCTGGGACTTGATCATACATTTTATATGGT 
TGGGACTTCTCTCTTCGGGAGATGATATCTTGTTTAAGGAGACCTCTTTTCAGTTCATCAAG 
TTCATCAGATATTTGAGTGCCCACTCTGTGCCCAAATAAATATGAGCTGGGGATTAAAAAAA 
AAAAAT^Uy^AAAAAAAAAAAAAAAAAAAAAAAAAAAATUUUWUVAAAAA 
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264/330 
FIGURE 264 

MELSQMSELMGLSVLLGLLALMATAAVARGWLRAGEERSGRPACQKANGFPPDKSSGSKKQK 
QYQRIRKEKPQQHNFTHRLLAAALKSHSGNISCMDFSSNGKYLATCADDRTIRIWSTKDFLQ 
REHRSMRANVELDHATLVRFSPDCRAFIVWLANGDTLRVFKMTKREDGGYTFTATPEDFPKK 
HKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPCGRFVASCG 
FTPDVKVWEVCFGKKGEFQEVVRAFELKGHSAAVHSFAFSNDSRRMASVSKDGTWKLWDTDV 
EYKKKQDPYLLKTGRFEEAAGAAPCRLALSPNAQVLALASGSSIHLYNTRRGEKEECFERVH 
GECIANLSFDITGRFLASCGDRAVRLFHNTPGHRAMVEEMQGHLKRASNESTRQRLQQQLTQ 
AQETLKSLGALKK 

Iznportan't features : 
Signal peptide: 

amino acids 1-25 

N-glycosylation site. 

amino acids 76-80, 92-96, 231-235, 289-293, 378-382, 421-425 
Beta-transducin family Trp-Asp repeat protein. 

amino acids 30-47, 105-118, 107-119, 203-216, 205-217, 296-308 
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265/330 
FIGURE 265 

TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGG 
CAGTGTTTTGCCTTCACCCCAAGTGACC^TGAGAGGTGCCACGCGAGTCTCAATCATGCTCC 
TCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGT 
GGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCT 
GGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCA 
AGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTAC 
CGCTGCTCCATGGACTTGAAGAACATCAATTTTTAGGCGCTTGCCTGGTCTCAGGATACCCA 
CCATCCTTTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGAC 
TCTCCCAGTCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAG 
GCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGG 
CTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTA 
AATGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTG 
CCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGT 
GCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGG 
ACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGAC 
TCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCA 
GGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGA 
CCTGTGACCTTCTGCCAGAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGT 
TAACCACTGAAGCCCCCAATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAA 
TCTAATCTGATATTGACATATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCA 
AGGATCAGCCCTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGG 
TGGGAGCAAGGGACAGGGAGCAGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGG 
CAACTACACACCAACATGCTGGCTTTAGAATAAAAGCACCAACTGAAAAAA 
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266/330 
FIGURE 266 

MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHP 
GSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 

Signal peptide: 

amino acids 1-19 

Tyrosine kinase phosphorylation site: 

amino acids 88-95 

K-myristoylation sites : 

amino acids 33-39, 35-41, 46-52 
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267/330 

FIGPRE 267 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATGTCTTTC 
CTCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGC 
TGCTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGG 
CCCTGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAA 
GCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTT 
CCTCTGTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCG 
TCCCCCTCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTAT 
TTCAAAGGAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGAT 
GATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAG 
GCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCA 
GGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACT 
TTCTGTTCTGGAAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAATGAT 
TGTGTGAAACTGCCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATT 
GTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTA 
TTAATGTATTTTAATATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAAACAAGACTGA 
CAAAAATCTGAAAAACTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAA. 
TTGACTGCCAGGCTGGGTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGG 
TGAGCAAGTCACTTGAGGTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTC 
TCTACTAAAAATACAAAAATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCG 
GGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCA 
CACCACTGTATTCCAGCCTGGGTGACTGAGACTCTAACTAA 
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268/330 
FIGURE 268 

MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEKEPR 
TFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAWKEHIRTEVKDF 
QPYFKGEIFLDEKKKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFV 
VGSGKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK 
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269/330 



FIGURE 269 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCG 
GGCCAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGA 
AGCCCCTTCCTCGGCGCTGCCAACCCGCCACCCAGCCCMGGCGAACCCCGGGCTGGGGCTG 
CTTCTGGCGCTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACA 
GACCACTTCTGCAAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATG 
GCAACCTGCGTCCGGAAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTG 
CTCCTGGCTGTGGGGCTGGCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGG 
CACCTACCGGCCCAGTAGCGAGGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGG 
ACTCCAAGGAGACGGTGCAGGGCTGCCTGCCCATCTA6GTCCCCTCTCCTGCATCTGTCTCC 
CTTCATTGCTGTGTGACCTTGGGGAAAGGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAG 
TGCTTAATAGCAGGGAAGAAGGTACTTCAAAGACTCTGCCCCTGAGGTCAAGAGAGGATGGG 
GCTATTCACTTTTATATATTTATATAAAATTAGTAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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270/330 
FIGURE 270 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIV 
VFSLLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 
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271/330 
FIGURE 271 

AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTT 
TTGGGATTTTAATTTTCAAACACAGCAGAASSACATTTTTTCTGTCACTATTATTATTGTTG 
GTATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCT 
ATCAAGAAATAAAGAGAACCACAGTCAACCCACACAATCATCTTTAGAAGACAGTGTGACTC 
CTACCAAAGCTGTCAAAACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGA 
GGGTTAATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGATVAAACACTTAGATTCAATG 
ATTGTAAATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATA 
TATACAATAAGGTGAAATTATAAGTACCCTATGCAGTTGGCTGGACAGTTCTAAATTGGACT 
TTATTAATTTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGA 
TCATATAATTTGATACAAATAAAAGAAAAGTGTTCTCTCCCCTTACAGAATTGACATTTTAA 
ATGCGATACAGTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAA 
AGAAGGGAAAATGTTGCCAAGGAAAAAAAAA 
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272/330 
FIGURE 272 

MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGK 
GIVKGRNLDSRGLILGAEAWGRGVKKNT 
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273/330 



FIGURE 273 



GCCAGGAATAACTAGAGAGGAACAM6GGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCT 
GTGCCTGCTGCACCAGTCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATA 
TTGTCATTGTTATAGATCCTAGTGTGCCAGAAGATGAAAAAATAATTGAACAAATAGAGGAT 
ATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGATTTTTTTTCAAAAA 
TGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATG 
AAAACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAACCA 
TACACCAAGCAGTTCACAGAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCT 
TCTACTTGGAAAAAAACAAAATGAATATGGACCACCAGGCAAACTGTTTGTCCATGAGTGGG 
CTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTCTACCGTGCTAAG 
TC;yy\AAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAA 
GTGTCAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATG 
GAAAAGATTGTCAATTCTTTCCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATG 
CAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAAAAAACCCATAATCAAGAAGCTCCAAG 
CCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAATTCTGAGGATT 
TTA7VAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATC 
AGTCAAAGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCT 
AAATCGAATGAATCAAGCAGCAAAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGG 
TGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTAAATAAGCTAATCCAAATAAAAAGC 
AGTGATGAAAGAAACACACTCATGGCAGGATTACCTACATATCCTCTGGGAGGAACTTCCAT 
CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGAT 
CCGAAGTACTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTG 
AAACAAAGTGGGGCCATTGTTCATTTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAAT 
AGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTTTCAGATGAAGCTCAGAACAATG 
GCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAGTCCCTT 
CAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAAT 
TGATAGTACAGTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTA 
TTTCTCTCTGGGATCCCAGTGGAACAATAATGGAAAATTTCACAGTGGATGCAACTTCCAAA 
ATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACTTGGGCATACAATCTTCAAGC 
CAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCTGTGC 
CTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATT 
GTTTACGCAGAAATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCAT 
TGAATCACAGAATGGACATACAGAAGTTTTGGAACTTTTGGATAATGGTGCAGGCGCTGATT 
CTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATATACAGAAAATGGCAGATAT 
AGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCACT 
GAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCM 
GACCTGAAATTGATGAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGA 
GGTGCATTTGTGGTATCACMGTCCCAAGCCTTCCCTTGCCTGACCAATACCCACCAAGTCA 
AATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTTACATGGACAGCACCAG 
GAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGT7VAATACTACTGATCTGTCACCAAAGGA 
GGCCAACTCCAAGGAAAGCTTTGCATTTAAACCAGAAAATATCTCAGAAGAAAATGCAACCC 
ACATATTTATTGCCATTAAAAGTATAGATAAAAGCAATTTGACATCAAAAGTATCCAACATT 
GCACAAGTAACTTTGTTTATCCCTCMGCAAATCCTGATGACATTGATCCTACACCTACTCC 
TACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTAT 
TGTCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTTGAACCTTA 
ACGAAGAAAAAAATCTTCAAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGT 
AAAGGATATTTCTGAATCTTAAAATTCATCCCATGTGTGATCATAAACTCATAAAMTAATT 
TTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATATGTAAAAACTGT 
CAAGATTAAMTTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAAC 
AAAGATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAAT 
GATATTTCAAATTGCATCAAGAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAA 
GGAGAGCAAATT^CAACATTTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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274/330 



FIGURE 274 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTY 
LFEATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTEC 
GEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATR 
CSAGISGRNRVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWE 
FCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLV 
LDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLM 
AGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVH 
FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 
LNSNAWMNDTVIIDSTVGKDTFFLITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPG 
TAKVGTWAYNLQAKANPETLTITVTSRAANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQG 
YVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHG 
GANTARLKLRPPLNRAAYIPGWWNGEIEANPPRPEIDEDTQTTLEDFSRTASGGAFWSQV 
PSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDNFDVGKVQRYIIRISASILDLRDSFDD 
ALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAIKSIDKSNLTSKVSNIAQVTLFIP 
QANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWIVNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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275/330 



FIGURE 275 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCG 
GGCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAG 
ACGCCAAGGTAATTTTGACCCAGAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACC 
CCCAGTTATGCCAGGATTTACTAGAGAGTGTCAACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTT 
GTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCAGTGCTCAGAAGTGACTTGCCTGA 
GGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAAGGCTGTGATGGG 
AATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAAA 
GTAGAGAAGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATG 
CAAGCAGCTCCGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGG 
GCCCCGGCTTTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCC 
GGGGGCTGCTTGCGTGGATTTCCCGGGTGGTGGTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGT 
CCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAGCTGGCACTGCCCAGGGCCAACAGC 
CCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCACCGCAACTACGTGA 
GCAGCCTGAAGCGGCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGGAA 
TGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAG 
GCCGACCTCCTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGG 
CCACAGAGTATGCAGCAGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGG 
CCTTACCCGCCACCCCGAGGAGAAGCCTGTGAGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATT 
GAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGCCCCAATCACCGTCCTTACACGGCCT 
CTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACATTGTATGAGCTCACCTTCAA 
AGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAAGTG 
AAAAATGAAAAGCTCAACATGGCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGG 
ACAAGTTCCGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCT 
CACTGTTGTTTACTTTGGGAAAGAAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAA 
GCTGCCAACTTCAGGAACTTTACCTTCATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTG 
ATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTCTTTTTCTGTGATGTGGACATCTACTT 
CACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAGGTATTTTATCCAGTT 
CTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAACAGC 
AGCTGGTCATAAAGAAGGAAACTGGATTTTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCG 
GTCAGACTTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCAC 
CTTTATCGCAAGTATCTCCACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACC 
TCTGGCATGAGAAGCGCTGCATGGACGAGCTGACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAA 
GGCCATGAACGAGGCATCCCACGGCCAGCTGGGCATGCTGGTGTTCAGGCACGAGATAGAGGCTCAC 
CTTCGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGAGAAGGATTGTGGGAGA 
CACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAAGGACG 
ACAAAAGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTAC 
AACAGAAATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACA 
AAGGCAGAATGCTTGTGAGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACT 
GAGACCTGTTGTTTTGTGTGCTCATTGAAATATTCATGATTTAAGAGCAGTTTTGTAAAAAATTCAT 
TAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGAGCCTATCAGCAGGGCTCTAGTTTCTAGG 
AATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACTAGTGAGTACATTAAGTA 
AAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTTCCCCAAGATTAACCA 
AAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAAT 
GCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGA 
GCACAAGTTGGCCTACATTTTTATATTTTTTAAGAAGATACTTTGAGATGCATTATGAGAACTTTCA 
GTTCAAAGCATCAAATTGATGCCATATCCAAGGACATGCCAAATGCTGATTCTGTCAGGCACTGAAT 
GTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACTAATACAGACGTACAGATACTTTCTCTGAA 
GAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGACACTTTCTGCTTTACAGAA 
AAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAGAAACCACATTTTCTCCTCA 
GAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCT 
TTTCAAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATGGAGAAAAATATATATAT 
ATATATATATATTGTGAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACAT 
GTTATCCACCCCAGGCCAGGTGGAAGTAACTGAATTATTTTTT/y^TTAAGCAGTTCTACTCAATCA 
CCAAGATGCTTCTGAAAATTGCATTTTATTACCATTTCAAACTATTTTTTAAAAATAAATACAGTTA 
ACATAGAGTGGTTTCTTCATTCATGTGAAAATTATTAGCCAGCACCAGATGCATGAGCTAATTATCT 
CTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAACATTCAAGC 
TGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACAC 
AGGCCATGAATGGAAGGTGGTATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 
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276/330 
FIGURE 276 

MMMVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 
EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFL 
HSQVDKAEVNAGVKLATEYAAVPFDSPTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIES 
ALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPI 
MKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVK 
GILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCR 
LNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFI 
NIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 
KAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSKKT 
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277/330 
FIGURE 277 

GAAAGA ATG TTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCTGAACTCTGTCAACC 
AGGTGCAGAAAATGCTTTTAAAGTGAGACTTAGTATCAGAACAGCTCTGGGAGATAAAGCAT 
ATGCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAA 
GTTCCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGT 
ATCATTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCACACCCTTCCTGCTGTTGAGGTGC 
AATCAGCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACCAAACT 
CTGGAATTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTG 
GATTATTATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTAT 
CAGGGATCTGGCAACGTAGAAGAAAGAACAAAGAACCATCTGAAGTGGATGACGCTGAAGAT 
AAGTGTGAAAACATGATCACAATTGAAAATGGCATCCCCTCTGATCCCCTGGACATGAAGGG 
GGGCATATTAATGATGCCTTCASGACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGT 
TGTTCTGCTTCCTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATA 
CCAAGAGCAGATCATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCT 
TGAAAGTGAAAAGCAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGAC 
TCAAAATATTCTAAAATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTG 
TAGTTATTGATTTAAGCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTC 
AAAGACCTAAGGAAAAATAAATTTTCCAGTGGAGAATACATATAATATGGTGTAGAAATCAT 
TGAAAATGGATCCTTTTTGACGATCACTTATATCACTCTGTATATGACTAAGTAAACAAMG 
TGAGAAGTAATTATTGTAAATGGATGGATAAAAATGGAATTACTCATATACAGGGTGGAATT 
TTATCCTGTTATCACACCAACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGAC 
AATTCTATTTGTTGACCATTTCTACAATTTGTAAAAGTCCAATCTGTGCTAACTTAATAAAG 
T AATAATC ATCTCT T TTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
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278/330 
FIGURE 278 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTALGDKAYAWDTNEEYLFKAMVAFSMRKVP 
NREATEISHVLLCNVTQRVSFWFWTDPSKNHTLPAVEVQSAIRMNKNRINNAFFLNDQTLE 
FLKIPSTLAPPMDPSVPIWIIIFGVIFCIIIVAIALLILSGIWQRRRKNKEPSEVDDAEDKC 
ENMITIENGIPSDPLDMKGGILMMPS 
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i?TnTTOi7 ono 

*i JLiVJWAVfcil mm f ^ 



AACTCAAACTCCTCTCTCTGGGA7VAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGG 
GTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTCATGT 
ATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGG 
CCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGGACAGATGC 
TCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGA 
ATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTC 
CAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGA 
TGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGA 
AGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTA 
CGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCAT 
AATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTC 
ATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCAAGAGAAAAAGGTCTCT 
GTTTATTTAGAAGACACAGACraACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAA 
GAACCCTAGTATTTCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGT 
TTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCAC 
AGTGCTCCTCCATATCACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGA 
GTGTAAATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAA 
GACACTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCC 
AATTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTA 
ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTG 
TTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTC 
TCCTTAACTATGAGACACATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATT 
TTTGTCG 
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280/330 
FIGURE 280 

MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVT 
WNFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQ 
VKNPPDVDGVIGEIRLSVVHTVRFSEIHFLALAIGSACALMIIIVIVWLFQHYRKKRWAER 
AHKWEIKSKEEERLNQEKKVSVYLEDTD 
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281/330 
FIGURE 281 

GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTGGT 
ACTCTTGGGAGTTTCCATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTCCAGCTG 
ACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATGCTGAAACCACTGCTGCT 
GCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCTGCTTCTACCACTGCTCG 
TAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGAGTGTGTCCCT 
G^GATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCTTCCTGTGATTTC 
ATCCAACTACTTACCTTGCCTACGATATCCCCTTTATCTCTAATCAGTTTATTTTCTTTCAA 
ATAAAAAATAACTATGAGCAACATAAAAAAAAAAAAA 
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282/330 
FIGURE 282 

MKFLAVLVLLGVSIFLVSAQNPTTAAPADTYPATGPADDEAPDAETTAAATTATTAAPTTAT 
TAASTTARKDIPVLPKWVGDLPNGRVCP 
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283/330 
FIGURE 283 

GGACTCTGAAGGTCCCAAGCAGCTGCTGAGGCCCCCAAGGAAGTGGTTCCAACCTTGGACCC 
CTAGGGGTCTGGATTTGCTGGTTAACAAGATAACCTGAGGGCAGGACCCCATAGGGGAATGC 
TACCTCCTGCCCTTCCACCTGCCCTGGTGTTCACGGTGGCCTGGTCCCTCCTTGCCGAGAGA 
GTGTCCTGGGTCAGGGACGCAGAGGACGCTCACAGACTCCAGCCCTTTGTTACCGAGAGGAC 
ACTTGGCAAGGTCCAGCGATGGTCCGGAGTCCACACACAGACTGGCGGCAGGGCAGGAGGGG 
GACAGTTCTGTTGTGCTTGGTTGGACAGTAAGAGGGTCTTGGCCAGTCCAGGGTGGGGGGCG 
GCAAACTCCATAAAGAACCAGAGGGTCTGGGCCCCGGCCACAGAGTCATCTGCCCAGCTCCT 
CTGCTGCTGGCCAGTGGGAGTGGCACGAGGTGGGGCTTTGTGCCAGTAAAACCACAGGCTGG 
ATTTGCCTGCGGGCCATGGTCCCTGTCTAGGGCAGCAATTCTCAACCTTCTTGCTCTCAGGA 
CCCCAAAGAGCTTTCATTGTATCTATTGATTTTTACCACATTAGCAATTAAAACTGAGAAAT 
GGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGAT 
CACCTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTACTAAAAA 
TACAAAAAATTAGCCAGGCACAGTGGTGTGCACTGGTAGTCCCAGTTACTCGGGAGGCTGAG 
GCAGGAAAATCGCTTGAACCCAGGAGGCGGACGTTGCGGTGAGCCGAGATCGCGCCGCTGAT 
TCCAGCCTGGGCGACAAGAGTGAGACTCCATCTCACACA 
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284/330 
FIGURE 284 

MLPPALPPALVFTVAWSLLAERVSWVRDAEDAHRLQPFVTERTLGKVQRWSGVHTQTGGRAG 
GGQFCCAWLDSKRVLASPGWGAANSIKNQRVWAPATESSAQLLCCWPVGVARGGALCQ 
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285/330 
FIGURE 285 

GTCATGCCAGTGCCTGCTCTGTGCCTGCTCTGGGCCCTGGCAATGGTGACCCGGCCTGCCTCA 
GCGGCCCCCATGGGCGGCCCAGAACTGGCACAGCATGAGGAGCTGACCCTGCTCTTCCATGG 
GACCCTGCAGCTGGGCCAGGCCCTCAACGGTGTGTACAGGACCACGGAGGGACGGCTGACAA 
AGGCCAGGAACAGCCTGGGTCTCTATGGCCGCACAATAGAACTCCTGGGGCAGGAGGTCAGC 
CGGGGCCGGGATGCAGCCCAGGAACTTCGGGCAAGCCTGTTGGAGACTCAGATGGAGGAGGA 
TATTCTGCAGCTGCAGGCAGAGGCCACAGCTGAGGTGCTGGGGGAGGTGGCCCAGGCACAGA 
AGGTGCTACGGGACAGCGTGCAGCGGCTAGAAGTCCAGCTGAGGAGCGCCTGGCTGGGCCCT 
GCCTACCGAGAATTTGAGGTCTTAAAGGCTCACGCTGACAAGCAGAGCCACATCCTATGGGC 
CCTCACAGGCCACGTGCAGCGGCAGAGGCGGGAGATGGTGGCACAGCAGCATCGGCTGCGAC 
AGATCCAGGAGAGACTCCACACAGCGGCGCTCCCAGCCTGAATCTGCCTGGATGGAACTGAG 
GACCAATCATGCTGCAAGGAACACTTCCACGCCCCGTGAGGCCCCTGTGCAGGGAGGAGCTG 
CCTGTTCACTGGGATCAGCCAGGGCGCCGGGCCCCACTTCTGAGCACAGAGCAGAGACAGAC 
GCAGGCGGGGACAAAGGCAGAGGATGTAGCCCCATTGGGGAGGGGTGGAGGAAGGACATGTA 
CCCTTTCATGCCTACACACCCCTCATTAAAGCAGAGTCGTGGCATTTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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286/330 
FIGURE 286 

MPVPALCLLWALAMVTRPASAAPMGGPELAQHEELTLLFHGTLQLGQALNGVYRTTEGRLTK 
ARNSLGLYGRTIELLGQEVSRGRDAAQELRASLLETQMEEDILQLQAEATAEVLGEVAQAQK 
VLRDSVQRLEVQLRSAWLGPAYREFEVLKAHADKQSHILWALTGHVQRQRREMVAQQHRLRQ 
IQERLHTAALPA 
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287/330 
FIGURE 287 

GGCAACATGGCTCAGCAGGCTTGCCCCAGAGCCATGGCAAAGAATGGACTTGTAATTTGCAT 
CCTGGTGATCACCTTACTCCTGGACCAGACCACCAGCCACACATCCAGATTAAAAGCCAGGA 
AGCACAGCAAACGTCGAGTGAGAGACAAGGATGGAGATCTGAAGACTCAAATTGAAAAGCTC 
TGGACAGAAGTCAATGCCTTGAAGGAAATTCAAGCCCTGCAGACAGTCTGTCTCCGAGGCAC 
TAAAGTTCACAAGAAATGCTACCTTGCTTCAGAAGGTTTGAAGCATTTCCATGAGGCCAATG 
AAGACTGCATTTCCAAAGGAGGAATCCTGGTTATCCCCAGGAACTCCGACGAAATCAACGCC 
CTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGACTTTTGGCTGGGCATCAATGA 
CATGGTCACGGAAGGCAAGTTTGTTGACGTCAACGGAATCGCTATCTCCTTCCTCAACTGGG 
ACCGTGCACAGCCTAACGGTGGCAAGCGAGAAAACTGTGTCCTGTTCTCCCAATCAGCTCAG 
GGCAAGTGGAGTGATGAGGCCTGTCGCAGCAGCAAGAGATACATATGCGAGTTCACCATCCC 
TAAA TAGG TCTTTCTCCAATGTGTCCTCCAAGCAAGATTCATCATAACTTATAGGTTCATGA 
TCTCTAAGATCAAGTAAAAATCATAATTTTTACTTATTAAAAAATTGCAACACAAGATCAAT 
GTCCATAGCAATATGATAGCATCAGCCAATTTTGCTMCACATTTCTTTGGGATTTTGCCCT 
TCCTGGGGTATAGGGGATCAGAAATATTGATCCATGTGCACGCAGATAAAATGGCTTCTGCT 
AAACAGACTAAAATCTTTCTCTCTAGTCTTTCTCACTTGTACAAACCCAGTTTGTTTTCAAA 
AAATCACAGTAGCAATGCAACTCATCACTCTAGAAAAGCAAGCTTAGGCTACCTGAAAGATT 
TTCCCTTGGAAGTTTAGCGTATGTTTGACTAACAAAAATTCCCTACATCAGAGACTCTAGGT 
GCTATATAATCCAAAAACTTTTCAGCCTGTTGCTCATTCTGTCCCATGCTGGCAATAATACC 
TTGTCAGCCCATTACCCTTATTTTGAATTGCTCCATCTCCTGGTGGGACTTGTATCTTGTCT 
GCCATATCAGAACACAAACCCCTGAAGAGGTTCTGATTTGATTTTTTTTTTTTCTTCATGCC 
TACCCTTTTTTTGGAAGTTTCCAGCCGCAATTTGAAATGAAATGACAAGGTGTATATTTGAT 
CAATTTTCATTCCCACCATTGCATTACAACCTCTAACTTAAATGGGTAACCCTAAGGCATAT 
CAAAGAAGCAGATTGCATGATAAACGGAAATAGAAAAAAAGAACCTACATTTATTTTGCTTT 
AGCATCCTTACTCTCACCTTTTATGAGATTGAGAGTGGACTTACATTTCCTTTTTTACATTT 
TCGTATATTTATTTTTTTTAGCCATCATTATATGTTTAAGTCTATTATGGGCAACCAATCTT 
TGGAAGCTGAAAACTGAATTTAAAGAATGCTATCTTGGAAAATTGCATACGTCTGTGCAATT 
TTTTATTCTGCCTAGTGCTATTCTGCTTGTTTAACTAGATTGTACAAAATAACTTCATTGCT 
TAATATCAAATTACAAAGTTTAGACTTGGAGGGAAATGGGCTTTTTAGAAGCAAACAATTTT 
AAATATATTTTGTTCTTCAAATAAATAGTGTTTAAACATTGAATGTGTTTTGTGAACAATAT 
CCCACTTTGCAAACTTTAACTACACATGCTTGGAATTAAGTTTTAGCTGTTTTCATTGCTCA 
AT AAT AAAGCCT G AAT T CT GATC AATAAAAAAAAAAAAAAAAAAAAAAA7VAAAAA 
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288/330 
FIGURE 288 

MAQQACPRAMAKNGLVICILVITLLLDQTTSHTSRLKARKHSKRRVRDKDGDLKTQIEKLWT 
EVNALKEIQALQTVCLRGTKVHKKCYLASEGLKHFHEANEDCISKGGILVIPRNSDEINALQ 
DYGKRSLPGVNDFWLGINDMVTEGKFVDVNGIAISFLNWDRAQPNGGKRENCVLFSQSAQGK 
WS DEACRS S KR Y I CE FT I PK 
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289/330 
FIGURE 289 

GCGAGGACCGGGTATAAGAAGCCTCGTGGCCTTGCCCGGGCAGCCGCAGGTTCCCCGCGCGC 
CCCGAGCCCCCGCGCCMGAAGCTCGCCGCCCTCCTGGGGCTCTGCGTGGCCCTGTCCTGCA 
GCTCCGCTGCTGCTTTCTTAGTGGGCTCGGCCAAGCCTGTGGCCCAGCCTGTCGCTGCGCTG 
GAGTCGGCGGCGGAGGCCGGGGCCGGGACCCTGGCCAACCCCCTCGGCACCCTCAACCCGCT 
GAAGCTCCTGCTGAGCAGCCTGGGCATCCCCGTGAACCACCTCATAGAGGGCTCCCAGAAGT 
GTGTGGCTGAGCTGGGTCCCCAGGCCGTGGGGGCCGTGAAGGCCCTGAAGGCCCTGCTGGGG 
GCCCTGACAGTGTTTGGCTG^GCCGAGACTGGAGCATCTACACCTGAGGACAAGACGCTGCC 
CACCCGCGAGGGCTGAAAACCCCGCCGCGGGGAGGACCGTCCATCCCCTTCCCCCGGCCCCT 
CTCAATAAACGTGGTTAAGAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 
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290/330 
FIGURE 290 

MKLAALLGLCVALSCSSAAAFLVGSAKPVAQPVAALESAAEAGAGTLANPLGTLNPLKLLLS 
SLGIPVNHLIEGSQKCVAELGPQAVGAVKALKALLGALTVFG 
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291/330 
FIGURE 291 

TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACT 
CCTTGGCCTCCGCAGCCGATCACATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTG 
GCACAGGTGTGGCTGGTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCC 
TCAGAACCAGACCAGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGG 
CCAGCGAGGAGAAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAGCAGGCAGCAGCTT 
GCCAAGGAGACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGG 
CAACATGGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGAeAGGCTTGATGCTGGGGGCCA 
CAGGGCCGACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAG 
CCCGGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACT 
GGGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCT 
TCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCA 
CAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACT 
GTTTGATGAGATTAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTCAAAGGGA 
AATGGTTGACCCCATTTGACCCTGTCTTCACCGAAGTCGACACTTTCCACCTGGACAAGTAC 
AAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAA 
TTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCA 
TGGAGAAAATGGGTGACCACCTCGCCCTTGAAGACTACCTGACCACAGACTTGGTGGAGACA 
TGGCTCAGAAACATGAAAACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCTAGATCA 
GAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTG 
CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGA 
ACAGTGATTGAAGTTGATGAAAGGGGCACTGAGGCAGTGGCAGGAATCTTGTCAGAAATTAC 
TGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAG 
AAACCTCTGGAATGCTTCTGTTTCTGGGCAGGGTGGTGAATCCGACTCTCCTATAATTCAGG 
ACATGCATAAGCACTTCGTGCTGTAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGA 
TACCAGCAATGGATGGCAGGGGAGAGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTC 
AAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGC 
CTGAAACGGTGGACAGTGCTGAACCTTATATATATTTTTTCCTACACATACATACCTATGAT 
AAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAATAACAACATTAAGTAAAA 
TGAGTTACTTGAACGCAAGCACTGCAATACCATAACAGTCAAACTGATTATAGAGAAGGCTA 
CTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGACATTGGGCAAGGGGAGAATTCA 
CATCCTGGGTGGGACAGAGCAGGACGATGCAAGATTCCATCCCACTACTCAGAATGGCATGC 
TGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTTTCATTTAATGTTTTTGGACCATGGT 
TGACCATGGTTAACTGAGACTGCAGAAAGCAAAACCATGGATAAGGGAGGACTACTACAAAA 
GCATTAAATTGATACATATTTTTTAAAAAAAAAAAAAAAAAAA 
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292/330 
FIGURE 292 

MKWPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRWQAPREEEEDEQEASEEKAGE 
EEBCAWLMASRQQLAKETSNFGFSLLRKISMRHDGNMVFSPFGMSLAMTGLMLGATGPTETQI 
KRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYF 
DTECVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKGKWLTPFDP 
VFTEVDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLWLMEKMGDHL 
ALEDYLTTDLVETWLRNMKTRNMEVFFPKFKLDQKYEMHELLRQMGIRRIFSPFADLSELSA 
TGRNLQVSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPFHFMIYEETSGMLLF 
LGEWNPTLL 
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293/330 
FIGURE 293 

CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGACCCCAGACATGAG 
GAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTCCCAG 
CACCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGACCCAGAGAAG 
GCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCC 
TGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCA 
TCCTTCCAGGCACC7VAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCC 
GAGCCCGACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAGGGCGAGGAGAGGCC 
CCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACA 
TCTACCACCCCCAGTA6GGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGG 
CTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAA 
AAAAAAAAAAAAAAAA 
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294/330 
FIGURE 294 

MRRLLLVTSLWVLLWEAGAVPAPKVPIECMQVKHWPSEQDPEKAWGARVVEPPEKDDQLWL 
FPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEE 
RPRLWVMPNHQVLLGPEEDQDHIYHPQ 
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295/330 

FIGPRE 295 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTG 
TACCCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGC 
TGTTTCTCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAA 
TGGACCTGTTCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCC 
TAGTGCATTTGATGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCT 
GTGACATGACCTCTGGGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATG 
CGTGGGAAGTGCACGGTGGGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCC 
AGAGGGGGACGGCAACTGGGCCAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCG 
ATGACTACAAGAACCCTGGCTACTACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTG 
CCCAATAAGTCCCCCATGCAGCACTGGAGAAACAGCTCCCTGCTGAGGTACCGCACGGACAC 
TGGCTTCCTCCAGACACTGGGACATAATCTGTTTGGCATCTACCAGAAATATCCAGTGAAAT 
ATGGAGAAGGAAAGTGTTGGACTGACAACGGCCCGGTGATCCCTGTGGTCTATGATTTTGGC 
GACGCCCAGAAAACAGCATCTTATTACTCACCCTATGGCCAGCGGGAATTCACTGCGGGATT 
TGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAACGCCTTGTGTGCTGGAATGAGGG 
TCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAGGATACTTTCCAGAGGCCAGT 
CCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATATGGAACTCATGTTGGTTA 
CAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCGTTGAGAGTTTTGTG 
GGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAACAACTTACCCA 
GTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGAAAAAAA 
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296/330 
FIGURE 296 

MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTEN 
GVIYQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFG 
SAEAATSDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGI 
YQKYPVKYGEGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAAN 
ALCAGMRVTGCNTEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLL 
FYR 
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297/330 
FIGURE 297 

GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCC 
CACGAGGCTGCCGCATCCTGCCCTCGGAACAATGGGACTCGGCGCGCGAGGTGCTTGGGCCG 
CGCTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCC 
ATGGCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCCAGTGCTAACTCAAC 
AGAGACTCTCCAACATGTGCCTTCTGACCATACAAATGAAACTTCCAACAGTACTGTGAAAC 
CACCAACTTCAGTTGCCTCAGACTCCAGTAATACAACGGTCACCACCATGAAACCTACAGCG 
GCATCTAATACAACAACACCAGGGATGGTCTCAACAAATATGACTTCTACCACCTTAAAGTC 
TACACCCAAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAATGACCG 
TAACCCACAATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCAT 
TCTGAAGCAAAGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAAC 
GCTGGGAGTTTTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTC 
GGTATCGAACCATAGATGAACATGATGCCATCATTTAAGGAAATCCATGGACCAAGGATGGA 
ATACAGATTGATGCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCT 
CTTTTTGAAAATAGTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTA 
AAGATTCTTCAAGGTAACAAGGGTTTGGGTTTTGAAATAAACATCTGGATCTTATAGACCGT 
TCATACAATGGTTTTAGCAAGTTCATAGTAAGACAAACAAGTCCTATCTTTTTTTTTTGGCT 
GGGGTGGGGGCATTGGTCACATATGACCAGTAATTGAAAGACGTCATCACTGAAAGACAGM 
TGCCATCTGGGCATACAAATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGT 
AGCTCACATAAAGAACTTCAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCACA 
CAGAAATTATACAATCAAACTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTG 
TGCTTTAAACTGTAGTAGTTGGTCTAGAAACAAAATACTCC 
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298/330 
FIGURE 298 

MGLGARGAWAALLLGTLQVLALLGAAHESAAMAASANIENSGLPHNSSANSTETLQHVPSDH 
TNETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPGMVSTNMTSTTLKSTPKTTSVSQN 
TSQISTSTMTVTHNSSVTSAASSVTITTTMHSEAKKGSKFDTGSFVGGIVLTLGVLSILYIG 
CKM Y YS RRG I R YRT I DEH DAI I 
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299/330 

FIGURE 299 

CAGCCGGGTCCCAAGCCTGTGCCTGAGCCTGAGCCTGAGCCTGAGCCCGAGCCGGGAGCCGG 
TCGCGGGGGCTCCGGGCTGTGGGACCGCTGGGCCCCCAGCGATGGCGACCCTGTGGGGAGGC 
CTTCTTCGGCTTGGCTCCTTGCTCAGCCTGTCGTGCCTGGCGCTTTCCGTGCTGCTGCTGGC 
GCAGCTGTCAGACGCCGCCAAGAATTTCGAGGATGTCAGATGTAAATGTATCTGCCCTCCCT 
ATAAAGAAAATTCTGGGCATATTTATAATAAGAACATATCTCAGAAAGATTGTGATTGCCTT 
CATGTTGTGGAGCCCATGCCTGTGCGGGGGCCTGATGTAGAAGCATACTGTCTACGCTGTGA 
ATGCAAATATGAAGAAAGAAGCTCTGTCACAATCAAGGTTACCATTATAATTTATCTCTCCA 
TTTTGGGCCTTCTACTTCTGTACATGGTATATCTTACTCTGGTTGAGCCCATACTGAAGAGG 
CGCCTCTTTGGACATGCACAGTTGATACAGAGTGATGATGATATTGGGGATCACCAGCCTTT 
TGCAAATGCACACGATGTGCTAGCCCGCTCCCGCAGTCGAGCCAACGTGCTGAACAAGGTAG 
AATATGCACAGCAGCGCTGGAAGCTTCAAGTCCAAGAGCAGCGAAAGTCTGTCTTTGACCGG 
CATGTTGTCCTCAGCTAATTGGGAATTGAATTCAAGGTGACTAGAAAGAAACAGGCAGACAA 
CTGGAAAGAACTGACTGGGTTTTGCTGGGTTTCATTTTAATACCTTGTTGATTTCACCAACT 
GTTGCTGGAAGATTCAAAACTGGAAGCAAAAACTTGCTTGATTTTTTTTTCTTGTTAACGTA 
ATAATAGAGACATTTTTAAAAGCACACAGCTCAAAGTCAGCCAATAAGTCTTTTCCTATTTG 
TGACTTTTACTAATAAAAATAAATCTGCCTGTAAATTATCTTGAAGTCCTTTACCTGGAACA 
AGCACTCTCTTTTTCACCACATAGTTTTAACTTGACTTTCAAGATAATTTTCAGGGTTTTTG 
TTGTTGTTGTTTTTTGTTTGTTTGTTTTGGTGGGAGAGGGGAGGGATGCCTGGGAAGTGGTT 
AACAACTTTTTTCAAGTCACTTTACTAAACAAACTTTTGTAAATAGACCTTACCTTCTATTT 
TCGAGTTTCATTTATATTTTGCAGTGTAGCCAGCCTCATCAAAGAGCTGACTTACTCATTTG 
ACTTTTGCACTGACTGTATTATCTGGGTATCTGCTGTGTCTGCACTTCATGGTAAACGGGAT 
CTAAAATGCCTGGTGGCTTTTCACAAAAAGCAGATTTTCTTCATGTACTGTGATGTCTGATG 
CAATGCATCCTAGAACAAACTGGCCATTTGCTAGTTTACTCTAAAGACTAAACATAGTCTTG 
GTGTGTGTGGTCTTACTCATCTTCTAGTACCTTTAAGGACAAATCCTAAGGACTTGGACACT 
TGCAATAAAGAAATTTTATTTTAAACCCAAGCCTCCCTGGATTGATAATATATACACATTTG 
TCAGCATTTCCGGTCGTGGTGAGAGGCAGCTGTTTGAGCTCCAATATGTGCAGCTTTGAACT 
AGGGCTGGGGTTGTGGGTGCCTCTTCTGAAAGGTCTAACCATTATTGGATAACTGGCTTTTT 
TCTTCCTATGTCCTCTTTGGAATGTAACAATAAAAATAATTTTTGAAACATCAA 
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300/330 
FIGURE 300 

MATLWGGLLRLGSLLSLSCLALSVLLLAQLSDAAKNFEDVRCKCICPPYKENSGHIYNKNIS 
QKDCDCLHWEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTIIIYLSILG;.LLLyMVYLTL 
VEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVLARSRSRANVLNKVEYAQQRWKLQVQEQ 
RKSVFDRHWLS 
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301/330 
FIGURE 301 

GCACCTGCGACCACCGTGAGCAGTCMGGCGTACTCCACAGTGCAGAGAGTCGCTCTGGCTT 
CTGGGCTTGTCCTGGCTCTGTCGCTGCTGCTGCCCAAGGCCTTCCTGTCCCGCGGGAAGCGG 
CAGGAGCCGCCGCCGACACCTGAAGGAAAATTGGGCCGATTTCCACCTATGATGCATCATCA 
CCAGGCACCCTCAGATGGCCAGACTCCTGGGGCTCGTTTCCAGAGGTCTCACCTTGCCGAGG 
CATTTGCAAAGGCCAAAGGATCAGGTGGAGGTGCTGGAGGAGGAGGTAGTGGAAGAGGTCTG 
ATGGGGCAGATTATTCCAATCTACGGTTTTGGGATTTTTTTATATATACTGTACATTCTATT 
TAAGGTAAGTAGAATCATCCTAATCATATTACATCAAT^AAATCTAATATGGCGATAAAAA 
TCATTGTCTACATTAAAACTTCTTATAGTTCATAAAATTATTTCAAATCCATCATCTCTTTA 
AATCCTGCCTCCTCTTCATGAGGTACTTAGGATAGCCATTATTTCAGTTTCACATAAGAATG 
TTTACTCAATGTTTAAGTGTTTTGCCCCAAAATTCACAACTAACAAGGCAGAACTAGGACTT 
GAACATGGATCTTTTGGTTCTTAATCCAGTGAGTGATACAATTCAATGCACTCCCCTGCCA 
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302/330 

MAYSTVQRVALASGLVLALSLLLPKAFLSRGKRQEPPPTPEGKLGRFPPMMHHHQAPSDGQT 
PGARFQRSHLAEAFAKAKGSGGGAGGGGSGRGLMGQIIPIYGFGIFLYILYILFKVSRIILI 
ILHQ 
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303/330 
FIGURE 303 

CGGCTCGAGTGCAGCT&TGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTT 
GGATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTAC 
TGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTG 
GGTGATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGAT 
AGACTGGACTCTGTCACCAGGAGAGCACGCCAAGGACGAATATGTGCTATACTATTACTCCA 
ATCTCAGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGC 
AATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGA 
AATCCGCCTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCCAG 
AGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAG 
AGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGA 
GGAGATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGG 
GCCACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATG 
CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCT 
GGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCC 
CGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTC 
TGTGCCACAATCCTGCTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAA 
GAGTTCAGTGAATTCTACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAAAG 
AAAAACCCTGCCATTTTGAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTA 
CGGGAGGTGATCGAGGAAGAAGAACCAAGTGAAAAATCAGAGGCCACCTACATGACCATGCA 
CCCAGTTTGGCCTTCTCTGAGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGG 
GAATGCCAAAAACACAGCAAGCCTTTTGAGAAGMTGGAGAGTCCCTTCATCTCAGCAGCGG 
TGGAGACTCTCTCCTGTGTGTGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTC 
CCAGCTGTCCTCCTGTCTCATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGG 
CAGAGAGACTGGACAGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGC 
CTCTGGAGTGGGACACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTT 
GGATCAGACCCTCCTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATA 
AAAACCAACCCAAATCAA 
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304/330 
FIGURE 304 

MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTLSPG 
EHAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCNDGSLLLQDVQEADQGTYICEIRLKGES 
QVFKKAWLHVLPEEPKELMVHVGGLIQMGCVFQSTEVKHVTKVEWIFSGRRAKEEIVFRYY 
HKLRMSVEYSQSWGHFQNRVNLVGDIFRNDGSIMLQGVRESDGGNYTCSIHLGNLVFKKTIV 
LHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTV 
LVKNTKKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMHPVWPSLR 
SDRNNSLEKKSGGGMPKTQQAF 
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305/330 

CTATGAAGAAGCTTCCTGGAAAACAATAAGCAAAGGAAAACAAATGTGTCCCATCTCACATG 
GTTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGG 
AAACTACATTTTGCAAAGTCATTGAACTCTGAGCTCAGTTGCAGTACTCGGGAAGCCATGCA 
GGATGMGATGGATACATCACCTTAAATATTAAAACTCGGAAACCAGCTCTCGTCTCCGTTG 
GCCCTGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGG 
ATGGTTGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGA 
TGAGAATGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGG 
TAAAACMTCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAAC 
TGGAGATATTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAG 
TAAGCAGTACTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGG 
AGTACATCAAAGCCAGGACTCATTTAATTCGTTGGGTCGGATTATCTCGCCAGAAGTCGAAT 
GAGGTCTGGAAGTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGA 
TGGAAAAGGAAATATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTG 
AGAACAAACATTATTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCT 
TAATGCAAAGAGGTGGACAGGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGT 
ATGAATGCATCAGTAGCTGAAAAAAAAAAAAAA 
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z?Tr*TTO^ one 

Jg XUUXVCi -JvD 

MQDEDGYITLNIKTRKPALVSVGPASSSWWRVMALILLILCVGMWGLVALGIWSVMQRNYL 
QDENENRTGTLQQLAKRFCQYWKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWE 
ESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEFL 
EDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP 
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307/330 

CCCACGCGTCCGCGCAGTCGCGCAGTTCTGCCTCCGCCTGCCAGTCTCGCCCGCGATCCCGG 
CCCGGGGCTGTGGCGTCGACTCCGACCCAGGCAGCCAGCAGCCCGCGCGGGAGCCGGACCGC 
CGCCGGAGGAGCTCGGACGGCATGCTGAGCCCCCTCCTTTGCTGAAGCCCGAGTGCGGAGAA 
GCCCGGGCAAACGCAGGCTAAGGAGACCAAAGCGGCGAAGTCGCGAGACAGCGGACAAGCAG 
CGGAGGAGAAGGAGGAGGAGGCGAACCCAGAGAGGGGCAGCAAAAGAAGCGGTGGTGGTGGG 
CGTCGTGGCCATGGCGGCGGCTATCGCCAGCTCGCTCATCCGTCAGAAGAGGCAAGCCCGCG 
AGCGCGAGAAATCCAACGCCTGCAAGTGTGTCAGCAGCCCCAGCAAAGGCAAGACCAGCTGC 
GACAAAAACAAGTTAAATGTCTTTTCCCGGGTCAAACTCTTCGGCTCCAAGAAGAGGCGCAG 
AAGAAGACCAGAGCCTCAGCTTAAGGGTATAGTTACCAAGCTATACAGCCGACAAGGCTACC 
ACTTGCAGCTGCAGGCGGATGGMCCATTGATGGCACCAAAGATGAGGACAGCACTTACACT 
CTGTTTAACCTCATCCCTGTGGGTCTGCGAGTGGTGGCTATCCAAGGAGTTCAAACCAAGCT 
GTACTTGGCAATGAACAGTGAGGGATACTTGTACACCTCGGAACTTTTCACACCTGAGTGCA 
AATTCAAAGAATCAGTGTTTGAAAATTATTATGTGACATATTCATCAATGATATACCGTCAG 
CAGCAGTCAGGCCGAGGGTGGTATCTGGGTCTGAACAAAGAAGGAGAGATCATGAAAGGCAA 
CCATGTGAAGAAGAACAAGCCTGCAGCTCATTTTCTGCCTAAACCACTGAAAGTGGCCATGT 
ACAAGGAGCCATCACTGCACGATCTCACGGAGTTCTCCCGATCTGGAAGCGGGACCCCAACC 
AAGAGCAGAAGTGTCTCTGGCGTGCTGAACGGAGGCAAATCCATGAGCCACAATGAATCAAC 
GTAGCCAGTGAGGGCAAAAGAAGGGCTCTGTAACAGMCCTTACCTCCAGGTGCTGTTGAAT 
TCTTCTAGCAGTCCTTCACCCAAAAGTTCAAATTTGTCAGTGACATTTACCAAACAAACAGG 
CAGAGTTCACTATTCTATCTGCCATTAGACCTTCTTATCATCCATACTAAAGC 
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308/330 



FIGURE 308 



></usr/seqdb2/sst/DNA/Dnaseqs.full/ss.DNA284 98 
xsubunit 1 of 1, 245 aa, 1 stop 
XMW: 27564, pi: 10.18, NX(S/T): 1 

MAAAIASSLIRQKRQAREREKSNACKCVSSPSKGKTSCDKNKLNVFSRVKLFGSKKRRRRRP 
EPQLKGIVTKLYSRQGYHLQLQADGTIDGTKDEDSTYTLFNLIPVGLRWAIQGVQTKLYLA 
MNSEGYLYTSELFTPECKFKESVFENYYVTYSSMIYRQQQSGRGWYLGLNKEGEIMKGNHVK 
KNKPAAHFLPKPLKVAMYKEPSLHDLTEFSRSGSGTPTKSRSVSGVLNGGKSMSHNEST 

N~glycosylation site. 

amino acids 242-246 

Glycosaminoglycan at'tachment site. 

amino acids 165-169, 218-222 

Tyrosine kinase phosphorylation site. 

amino acids 93-100 

N-myristoylation site. 

amino acids 87-93, 231-237 

ATP/GTP-bindlng site motif A <P-loop) . 

amino acids 231-239 



HB6F/FGF family proteins 

amino acids 78-94, 102-153 
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309/330 
FIGURE 309 

CCAGGATGGAGCTGGGGCCTGTATAGCCATATTATTGTTCTATGCTACTAGACATGGGGGGG 
ACTTGGTGAAAAAGGTATTATCCAGCCAGAGGGTCTGGGAGCCCTGTCTTACTGAACCTGGG 
CAACCTGGATATTCTGAGACATATTTTGGGGGGATTTCAGTGAAAAAAGTGGGGGATCCCCT 
CCATTTAGAGTGTAGCAAAGGAAAAAACACCAAGGTTGGGTTCCTTCCTGACATTGGCAGTG 
CCCCAGTAGGGGTGGGATGAGCGAATATTCCCAAAGCTAAAGTCCCACACCCTGTAGATTAC 
AAGAGTGGATTTGGCAGGAGTGTGCCCCAAAATACAGTGGAAAGGTGCCTGAAGATATTTAA 
ACCACGTCTTGGAAATTTAGTGGGTCTTGGCTTTGGGATAGGTGAAGTGAGGACAGACACTG 
GAGAGGAGGGAAAGGGGACGTTTTCAATAGGAGGCAAAACTCGAGGGTGGGATCCACTGAGG 
AGTACATAGGCTGCTGGATCTGGTGGAGCCAGCACTGGGCCCACGGGTGGTAACTGGCTGCT 
GTGGAGGGGGGTACGTGAGGGGGGGGTCTGGGGCTTATCCTCAGGTCCTGTGGGTGGGGCAG 
CGAGTCGGGGCCTGAGCGTCAAGAGCATGCCCTAGTGAGCGGGCTCCTCTGGGGGAGCCCAG 
CGCGCTCCGGGCGCCTGCCGGTTTGGGGGTGTCTCCTCCCGGGGCGCTATGGCGGCGCTGGC 
CAGTAGCCTGATCCGGCAGAAGCGGGAGGTCCGCGAGCCCGGGGGCAGCCGGCCGGTGTCGG 
CGCAGCGGCGCGTGTGTCCCCGCGGCACCAAGTCCCTTTGCCAGAAGCAGCTCCTCATCCTG 
CTGTCCAAGGTGCGACTGTGCGGGGGGCGGCCCGCGCGGCCGGACCGCGGCCCGGAGCCTCA 
GCTCAAAGGCATCGTCACCAAACTGTTCTGCCGCCAGGGTTTCTACCTCCAGGCGAATCCCG 
ACGGAAGCATCCAGGGCACCCCAGAGGATACCAGCTCCTTCACCCACTTCAACCTGATCCCT 
GTGGGCCTCCGTGTGGTCACCATCCAGAGCGCCAAGCTGGGTCACTACATGGCCATGAATGC 
TGAGGGACTGCTCTACAGTTCGCCGCATTTCACAGCTGAGTGTCGCTTTAAGGAGTGTGTCT 
TTGAGAATTACTACGTCCTGTACGCCTCTGCTCTCTACCGCCAGCGTCGTTCTGGCCGGGCC 
TGGTACCTCGGCCTGGACAAGGAGGGCCAGGTCATGAAGGGAAACCGAGTTAAGAAGACCAA 
GGCAGCTGCCCACTTTCTGCCCAAGCTCCTGGAGGTGGCCATGTACCAGGAGCCTTCTCTCC 
ACAGTGTCCCCGAGGCCTCCCCTTCCAGTCCCCCTGCCCCCTGAAATGTAGTCCCTGGACTG 
GAGGTTCCCTGCACTCCCAGTGAGCCAGCCACCACCACAACCTGT 



wo 00/73454 




PCT/USOO/08439 



310/330 
FIGURE 310 

MAALASSLIRQKREVREPGGSRPVSAQRRVCPRGTKSLCQKQLLILLSKVRLCGGRPARPDR 
GPEPQLKGIVTKLFCRQGFYLQANPDGSIQGTPEDTSSFTHFNLIPVGLRVVTIQSAKLGHY 
MAMNAEGLLYSSPHFTAECRFKECVFENYYVLYASALYRQRRSGRAWYLGLDKEGQVMKGNR 
VKKTKAAAHFLPKLLEVAMYQEPSLHSVPEASPSSPPAP 

Tyrosine kinase phosphorylation site: 

amino acids 199-207 

N-myristoylation sites : 

amino acids 54-60, 89-95, 131-137 



HBGF/F6F family signature: 

amino acids 131-155 
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311/330 

FIGURE 311 

ATGGCCGCGGCCATCGCTAGCGGCTTGATCCGCCAGAAGCGGCAGGCGCGGGAGCAGCACTG 
GGACCGGCCGTCTGCCAGCAGGAGGCGGAGCAGCCCCAGCAAGAACCGCGGGCTCTGCAACG 
GCAACCTGGTGGATATCTTCTCCAAAGTGCGCATCTTCGGCCTCAAGAAGCGCAGGTTGCGG 
CGCCAAGATCCCCAGCTCAAGGGTATAGTGACCAGGTTATATTGCAGGCAAGGCTACTACTT 
GCAAATGCACCCCGATGGAGCTCTCGATGGAACCAAGGATGACAGCACTAATTCTACACTCT 
TCAACCTCATACCAGTGGGACTACGTGTTGTTGCCATCCAGGGAGTGAAAACAGGGTTGTAT 
ATAGCCATGAATGGAGAAGGTTACCTCTACCCATCAGAACTTTTTACCCCTGAATGCAAGTT 
TAAAGAATCTGTTTTTGAAAATTATTATGTAATCTACTCATCCATGTTGTACAGACAACAGG 
AATCTGGTAGAGCCTGGTTTTTGGGATTAAATAAGGAAGGGCAAGCTATGAAAGGGAACAGA 
GTAAAGAAAACCAAACCAGCAGCTCATTTTCTACCCAAGCCATTGGAAGTTGCCATGTACCG 
AGAACCATCTTTGCATGATGTTGGGGAAACGGTCCCGAAGCCTGGGGTGACGCCAAGTAAAA 
GCACAAGTGCGTCTGCAATAATGAATGGAGGCAAACCAGTCAACAAGAGTAAGACAACATAg 



PCT/USOO/08439 
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FIGURE 312 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss. DNA28503 
Xsubunit 1 of 1, 247 aa,- 1 stop 
XMW: 27702, pi: 10.36, NX(S/T): 2 

MA?^IASGLIRQKRQAREQHWDRPSASRRRSSPSKNRGLCNGNLVDIFSKVRIFGLKKRRLR 
RQDPQLKGIVTRLYCRQGYYLQMHPDGALDGTKDDSTNSTLFNLIPVGLRVVAIQGVKTGLY 
lAMNGEGYLYPSELFTPECKFKESVFENYYVIYSSMLYRQQESGRAWFLGLNKEGQAMKGNR 
VKKTKPAAHFLPKPLEVAMYREPSLHDVGETVPKPGVTPSKSTSASAIMNGGKPVNKSKTT 

N-glycosylation site. 

amino acids 100-104, 242-246 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 28-32, 29-33 

Tyrosine kinase phosphorylation site . 

amino acids 199-207 

N-myristoylation site. 

amino acids 38-44, 89-95, 118-124, 122-128, 222-228 
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HB6F/F6F family proteins. 

amino acids 104-155, 171-198 
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313/330 



FIGURE 313 



GGGGAGAGGAATTGACCATGTAAAAGGAGACTTTTTTTTTTGGTGGTGGTGGCTGTTGGGTGCCTTGCAAAAAT 
GAAGGATGCAGGACGCAGCTTTCTCCTGGAACCGAACGCAATGGATAAACTGATTGTGCAAGAGAGAAGGAAGA 
ACGAAGCTTTTTCTTGTGAGCCCTGGATCTTAACACAAATGTGTATATGTGCACACAGGGAGCATTCAAGAATG 
AAATAAACCAGAGTTAGACCCGCGGGGGTTGGTGTGTTCTGACATAAATAAATAATCTTT^GCAGCTGTTCCC 
CTCCCCACCCCCAAAAAAAAGGATGATTGGAAATGAAGAACCGAGGATTCACAAAGAAAAAAGTATGTTCATTT 
TTCTCTATAAAGGAGAAAGTGAGCCAAGGAGATATTTTTGGAATGAAAAGTTTGGGGCTTTTTTAGTAAAGTAA 
AGAACTGGTGTGGTGGTGTTTTCCTTTCTTTTTGAATTTCCCACAAGAGGAGAGGAAATTAATAATACATCTGC 
AAAGAAATTTCAGAGAAGAAAAGTTGACCGCGGCAGATTGAGGCATTGATTGGGGGAGAGAAACCAGCAGAGCA 
CAGTTGGATTTGTGCCTATGTTGACTAAAATTGACGGATAATTGCAGTTGGATTTTTCTTCATCAACCTCCTTT 
TTTTTAAATTTTTATTCCTTTTGGTATCAAGATCATGCGTTTTCTCTTGTTCTTAACCACCTGGATTTCCATCT 
GGATGTTGCTGTGATCAGTCTGAAATACAACTGTTTGAATTCCAGAAGGACCAACACCAGATAAATTATGAATG 
TTGAACAAGATGACCTTACATCCACAGCAGATAATGATAGGTCCTAGGTTTAACAGGGCCCTATTTGACCCCCT 
GCTTGTGGTGCTGCTGGCTCTTCAACTTCTTGTGGTGGCTGGTCTGGTGCGGGCTCAGACCTGCCCTTCTGTGT 
GCTCCTGCAGCAACCAGTTCAGCAAGGTGATTTGTGTTCGGAAAAACCTGCGTGAGGTTCCGGATGGCATCTCC 
ACCAACACACGGCTGCTGAACCTCCATGAGAACCAAATCCAGATCATCAAAGTGAACAGCTTCAAGCACTTGAG 
GCACTTGGAAATCCTACAGTTGAGTAGGAACCATATCAGAACCATTGAAATTGGGGCTTTCAATGGTCTGGCGA 
ACCTCAACACTCTGGAACTCTTTGACAATCGTCTTACTACCATCCCGAATGGAGCTTTTGTATACTTGTCTAAA 
CTGAAGGAGCTCTGGTTGCGAAACAACCCCATTGAAAGCATCCCTTCTTATGCTTTTAACAGAATTCCTTCTTT 
GCGCCGACTAGACTTAGGGGAATTGAAAAGACTTTCATACATCTCAGAAGGTGCCTTTGAAGGTCTGTCCAACT 
TGAGGTATTTGAACCTTGCCATGTGCAACCTTCGGGAAATCCCTAACCTCACACCGCTCATAAAACTAGATGAG 
CTGGATCTTTCTGGGAATCATTTATCTGCCATCAGGCCTGGCTCTTTCCAGGGTTTGATGCACCTTCAAAAACT 
GTGGATGATACAGTCCCAGATTCAAGTGATTGAACGGAATGCCTTTGACAACCTTCAGTCACTAGTGGAGATCA 
ACCTGGCACACAATAATCTAACATTACTGCCTCATGACCTCTTCACTCCCTTGCATCATCTAGAGCGGATACAT 
TTACATCACAACCCTTGGAACTGTAACTGTGACATACTGTGGCTCAGCTGGTGGATAAAAGACATGGCCCCCTC 
GAACACAGCTTGTTGTGCCCGGTGTAACACTCCTCCCAATCTAAAGGGGAGGTACATTGGAGAGCTCGACCAGA 
ATTACTTCACATGCTATGCTCCGGTGATTGTGGAGCCCCCTGCAGACCTCAATGTCACTGAAGGCATGGCAGCT 
GAGCTGAAATGTCGGGCCTCCACATCCCTGACATCTGTATCTTGGATTACTCCAAATGGAACAGTCATGACACA 
TGGGGCGTACAAAGTGCGGATAGCTGTGCTCAGTGATGGTACGTTAAATTTCACAAATGTAACTGTGCAAGATA 
CAGGCATGTACACATGTATGGTGAGTAATTCCGTTGGGAATACTACTGCTTCAGCCACCCTGAATGTTACTGCA 
GCAACCACTACTCCTTTCTCTTACTTTTCAACCGTCACAGTAGAGACTATGGAACCGTCTCAGGATGAGGCACG 
GACCACAGATAACAATGTGGGTCCCACTCCAGTGGTCGACTGGGAGACCACCAATGTGACCACCTCTCTCACAC 
CACAGAGCACAAGGTCGACAGAGAAAACCTTCACCATCCCAGTGACTGATATAAACAGTGGGATCCCAGGAATT 
GATGAGGTCATGAAGACTACCAAAATCATCATTGGGTGTTTTGTGGCCATCACACTCATGGCTGCAGTGATGCT 
GGTCATTTTCTACAAGATGAGGAAGCAGCACCATCGGCAAAACCATCACGCCCCAACAAGGACTGTTGAAATTA 
TTAATGTGGATGATGAGATTACGGGAGACACACCCATGGAAAGCCACCTGCCCATGCCTGCTATCGAGCATGAG 
CACCTAAATCACTATAACTCATACAAATCTCCCTTCAACCACACAACAACAGTTAACACAATAAATTCAATACA 
CAGTTCAGTGCATGAACCGTTATTGATCCGAATGAACTCTAAAGACAATGTACAAGAGACTCAAATCTAAAACA 
TTTACAGAGTTACAAAAAACAAACAATCAAAAAAAAAGACAGTTTATTAAAAATGACACAAATGACTGGGCTAA 
ATCTACTGTTTCAAAAAAGTGTCTTTACAAAAAAACAAAAT^GAAAAGAAATTTATTTATTAAAAATTCTATTG 
TGATCTAAAGCAGACAAAAA 
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314/330 
FIGURE 314 

MLNKMTLHPQQIMIGPRFNRALFDPLLWLLALQLLVVAGLVRAQTCPSVCSCSNQFSKVIC 
VRKNLREVPDGISTNTRLLNLHENQIQIIKVNSFKHLRHLEILQLSRNHIRTIEIGAFNGLA 
NLNTLELFDNRLTTIPNGAFVYLSKLKELWLRNNPIESIPSYAFNRIPSLRRLDLGELKRLS 
YISEGAFEGLSNLRYLNLAMCNLREIPNLTPLIKLDELDLSGNHLSAIRPGSFQGLMHLQKL 
WMIQSQIQVIERNAFDNLQSLVEINLAHNNLTLLPHDLFTPLHHLERIHLHHNPWNCNCDIL 
WLSWWIKDMAPSNTACCARCNTPPNLKGRYIGELDQNYFTCYAPVIVEPPADLNVTEGMAAE 
LKCRASTSLTSVSWITPNGTVMTHGAYKVRIAVLSDGTLNFTNVTVQDTGMYTCMVSNSVGN 
TTASATLNVTAATTTPFSYFSTVTVETMEPSQDEARTTDNNVGPTPVVDWETTNVTTSLTPQ 
STRSTEKTFTIPVTDINSGIPGIDEVMKTTKIIIGCFVAITLMAAVMLVIFYKMRKQHHRQN 
HHAPTRTVEIINVDDEITGDTPMESHLPMPAIEHEHLNHYNSYKSPFNHTTTVNTINSIHSS 
VHEPLLIRMNSKDNVQETQI 

Signal sequence: 

amino acids 1-44 

Transmembrane domain: 

amino acids 523-543 

N-glycosylatlon site. 

amino acids 278-282, 364-368, 390-394, 412-416, 415-419, 
434-438, 442-446, 488-492, 606-610 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 183-187 

Casein kinase II phosphorylation site. 

amino acids 268-272, 417-421, 465-469, 579-583, 620-624 
N-myristoylation site. 

amino acids 40-46, 73-79, 118-124, 191-197, 228-234, 237-243, 
391-397, 422-428, 433-439, 531-537 
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FIGURE 315 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACAT 
GGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGA 
GGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATC 
GGG ATG TCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCA 
CACTGAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGC 
TTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAA 
GTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAGGGCCG 
AGTGGCCTTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAGATTGAACCTCTGAAGC 
CCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCAT 
GTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAAGGAGAGCTGAC 
AGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATT 
ACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATT 
GACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTA 
CCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGT 
ATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTG 
ATTTTCCTCTTGGTGTGGCTGCTAATCCGAAGGAAAGACAAAGAAAGATATGAGGAAGAAGA 
GAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCT 
CCTCTTCCTCAGGCTCTCGGAGCTCACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAAT 
AGTGCCTCACGCAGCCAGCGGACACTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCAC 
CCAGGCATACAGCCTAGTGGGGCCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATG 
CTAATCTGACCAAAGCAGAAACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAA 
ACGGTC TGA ATTACAATGGACTTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTC 
TTCTCGTCATTGGAGCTCAAGTCACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCA 
GTGAGCATTGCACGGAACAGATTCAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAA 
AGGATGTAAGCTGATTCATCTGTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGG 
AAAGCAGGAGTCCAAATCTATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTG 
AGGTGAATATACCTAAAACTTTTAATGTGGGATATTTTGTATCAGTGCTTTGATTCACAATT 
TTCAAGAGGAAATGGGATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATT 
ATTAGTTATTCAGACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTAC 
TGAGCTAACCACTTCTAAGAAACTCCAAAAAAGGAAACATGTGTCTTCTATTCTGACTTAAC 
TTCATTTGTCATAAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGAGA 
AGAGTGAATGAGTTTCTCCCACTCTATACTAATCTCACTATTTGTATTGAGCCCAAAATAAC 
TATGAAAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCATGATGTT 
ATGAGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAAT 
CAGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACAACATGTCATT 
TATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAAAAGGGATCTAGGAATGCTGAAAGATTA 
CCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAGAATTGCAAGACTGG 
GTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAATATGTCAAGGAAGGTAGCCGGGCA 
TGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGAGATTATGCC 
ATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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FIGORE 316 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA45419 
Xsubunit 1 of 1, 373 aa, 1 stop 
XMW: 41281, pi: 8.33, NX{S/T): 3 

MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQKV 
VITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHV 
ILKVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRID 
YNHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCVVRVTVQYVQSIGMVAGAVTGIVAGALLI 
FLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSGSSSTRSTANS 
ASRSQRTLSTDAAPQPGLATQAYSLVGPEVRGSEPKKVHHANLTKAETTPSMIPSQSRAFQTV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 232-251 
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FIGURE 317 



CGCGAGGCGCGGGGAGCCTGGGACCAGGAGCGAGAGCCGCCTACCTGCAGCCGCCGCCCACGGCACGGCAGCCA 
CCATSGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGGATTTCGCCAGAAGTTTGAGTATCACTACT 
CCTGAAGAGATGATTGAAAAAGCCAAAGGGGAAACTGCCTATCTGCCATGCAAATTTACGCTTAGTCCCGAAGA 
CCAGGGACCGCTGGACATCGAGTGGCTGATATCACCAGCTGATAATCAGAAGGTGGATCAAGTGATTATTTTAT 
ATTCTGGAGACAAAATTTATGATGACTACTATCCAGATCTGAAAGGCCGAGTACATTTTACGAGTAATGATCTC 
AAATCTGGTGATGCATCAATAAATGTAACGAATTTACAACTGTCAGATATTGGCACATATCAGTGCAAAGTGAA 
AAAAGCTCCTGGTGTTGCAAATAAGAAGATTCATCTGGTAGTTCTTGTTAAGCCTTCAGGTGCGAGATGTTACG 
TTGATGGATCTGAAGAAATTGGAAGTGACTTTAAGATAAAATGTGAACCAAAAGAAGGTTCACTTCCATTACAG 
TATGAGTGGCAAAAATTGTCTGACTCACAGAAAATGCCCACTTCATGGTTAGCAGAAATGACTTCATCTGTTAT 
ATCTGTAAAAAATGCCTCTTCTGAGTACTCTGGGACATACAGCTGTACAGTCAGAAACAGAGTGGGCTCTGATC 
AGTGCCTGTTGCGTCTAAACGTTGTCCCTCCTTCAAATAAAGCTGGACTAATTGCAGGAGCCATTATAGGAACT 
TTGCTTGCTCTAGCGCTCATTGGTCTTATCATCTTTTGCTGTCGTAAAAAGCGCAGAGAAGAAAAATATGAAAA 
GGAAGTTCATCACGATATCAGGGAAGATGTGCCACCTCCAAAGAGCCGTACGTCCACTGCCAGAAGCTACATCG 
GCAGTAATCATTCATCCCTGGGGTCCATGTCTCCTTCCAACATGGAAGGATATTCCAAGACTCAGTATAACCAA 
GTACCAAGTGAAGACTTTGAACGCACTCCTCAGAGTCCGACTCTCCCACCTGCTAAGTTCAAGTACCCTTACAA 
GACTGATGGAATTACAGTTGTATAAATATGGACTACTGAAGAATCTGAAGTATTGTATTATTTGACTTTATTTT 
AGGCCTCTAGTAAAGACTTAAATGTTTTTTAAAAAAAGCACAAGGCACAGAGATTAGAGCAGCTGTAAGAACAC 
ATCTACTTTATGCAATGGCATTAGACATGTAAGTCAGATGTCATGTCAAAATTAGTACGAGCCAAATTCTTTGT 
TAAAAAACCCTATGTATAGTGACACTGATAGTTAAAAGATGTTTTATTATATTTTCAATAACTACCACTAACAA 
ATTTTTAACTTTTCATATGCATATTCTGATATGTGGTCTTTTAGGATU^GTATGGTTAATAGTTGATTTTTCAA 
AGGAAATTTTAAAATTCTTACGTTCTGTTTAATGTTTTTGCTATTTAGTTAAATACATTGAAGGGAAATACCCG 
TTCTTTTCCCCTTTTATGCACACAACAGAAACACGCGTTGTCATGCCTCAAACTATTTTTTATTTGCAACTACA 
TGATTTCACACAATTCTCTTAAACAACGACATAAAATAGATTTCCTTGTATATAAATAACTTACATACGCTCCA 
TAAAGTAAATTCTCAAAGGTGCTAGAACAAATCGTCCACTTCTACAGTGTTCTCGTATCCAACAGAGTTGATGC 
ACAATATATAAATACTCAAGTCCAATATTAAAAACTTAGGCACTTGACTAACTTTAATAAAATTTCTCAAACTA 
TATCAATATCTAAAGTGCATATATTTTTTAAGATiAGATTATTCTCAATAACTTCTATAAAAATAAGTTTGATGG 
TTTGGCCCATCTAACTTCACTACTATTAGTAAGAACTTTTAACTTTTAATGTGTAGTAAGGTTTATTCTACCTT 
TTTCTCAACATGACACCAACACAATCAAAAACGAAGTTAGTGAGGTGCTAACATGTGAGGATTAATCCAGTGAT 
TCCGGTCACAATGCATTCCAGGAGGAGGTACCCATGTCACTGGAATTGGGCGATATGGTTTATTTTTTCTTCCC 
TGATTTGGATAACCAAATGGAACAGGAGGAGGATAGTGATTCTGATGGCCATTCCCTCGATACATTCCTGGCTT 
TTTTCTGGGCAAAGGGTGCCACATTGGAAGAGGTGGAAATATAAGTTCTGAAATCTGTAGGGAAGAGAACACAT 
TAAGTTAATTCAAAGGAAAAAATCATCATCTATGTTCCAGATTTCTCATTAAAGACAAAGTTACCCACAACACT 
GAGATCACATCTAAGTGACACTCCTATTGTCAGGTCTAAATACATTAAAAACCTCATGTGTAATAGGCGTATAA 
TGT AT AAC AGGT GAG C AATGT TTTC TGAATG CAT AAAG AAATG AAT AAACT CAAAC AC AGT ACTTC CT AAACAA 
CTTCAACCAAAAAAGACCAAAACATGGAACGAATGGAAGCTTGTAAGGACATGCTTGTTTTAGTCCAGTGGTTT 
CCACAGCTGGCTAAGCCAGGAGTCACTTGGAGGCTTTTAAATACAAAACATTGGAGCTGGAGGCCATTATCCTT 
AGCAAACTAATGCAGAAACAGAAAATCAACTACCGCATGTTCTCACTTATAAGTGGGAGGTAATGATAAGAACT 
TATGAACACAAAGAAGGAAACAATAGACATTGGAGTCTATTTGAGAGGGGAGGGTGGGAGAAGGAAAAGGAGCA 
GAAAAGATAACTATTGAGTACTGCCTTCACACCTGGGTGATGAAATAATATGTACAACAAATCCCTGTGACACA 
TGTTTACCTATGGAACAAACCTTCATGTGTATCCCTAAACCTAAAATAAAAGTTAAAAAAAAAAAAARAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAA 
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318/330 
FIGURE 318 

></usr/seqdb2/sst/DNA/Dnaseqs.full/ss.DNA82361 
Xsubunit 1 of 1, 352 aa, 1 stop 
XMW: 38938, pi: 7.86, NX(S/T): 3 

MALLLCFVLLCGVVDFARSLSITTPEEMIEKAKGETAYLPCKFTLSPEDQGPLDIEWLISPA 
DNQKVDQVIILYSGDKIYDDYYPDLKGRVHFTSNDLKSGDASINVTNLQLSDIGTYQCKVKK 
APGVANKKIHLWLVKPSGARCYVDGSEEIGSDFKIKCEPKEGSLPLQYEWQKLSDSQKMPT 
SWLAEMTSSVISVKNASSEYSGTYSCTVRNRVGSDQCLLRLNVVPPSNKAGLIAGAIIGTLL 
ALALIGLIIFCCRKKRREEKYEKEVHHDIREDVPPPKSRTSTARSYIGSNHSSLGSMSPSNM 
EGYSKTQYNQVPSEDFERTPQSPTLPPAKFKYPYKTDGITW 

Signal sequence. 

amino acids 1-19 

Transmembrane domain: 

amino acids 236-257 

N-glycosylation sit:es. 

amino acids 106-110, 201-205, 298-302 

Tyrosine kinase phosphorylation sites. 

amino acids 31-39, 78-85, 262-270 

N-myristoylation sites . 

amino acids 116-122, 208-214, 219-225, 237-243, 241-247, 
245-251, 296-302 



Myelin PO protein. 

amino acids 96-125 
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TGAAATGACTTCCACGGCTGGGACGGGAACCTTCCACCCACAGCTATGCCTCTGATTGGTGA 
ATGGTGAAGGTGCCTGTCTAACTTTTCTGTAAAAAGAACCAGCTGCCTCCAGGCAGCCAGCC 
CTCAAGCATCACTTACAGGACCAGAGGGACAAGACATGACTGTGATGAGGAGCTGCTTTCGC 
CAATTTAACACCAAGAAGAATTGAGGCTGCTTGGGAGGAAGGCCAGGAGGAACACGAGACTG 
AGAGATGAATTTTCAACAGAGGCTGCAAAGCCTGTGGACTTTAGCCAGACCCTTCTGCCCTC 
CTTTGCTGGCGACAGCCTCTCAAATGCAGATGGTTGTGCTCCCTTGCCTGGGTTTTACCCTG 
CTTCTCTGGAGCCAGGTATCAGGGGCCCAGGGCCAAGAATTCCACTTTGGGCCCTGCCAAGT 
GAAGGGGGTTGTTCCCCAGAAACTGTGGGAAGCCTTCTGGGCTGTGAAAGACACTATGCAAG 
CTCAGGATAACATCACGAGTGCCCGGCTGCTGCAGCAGGAGGTTCTGCAGAACGTCTCGGAT 
GCTGAGAGCTGTTACCTTGTCCACACCCTGCTGGAGTTCTACTTGT^AAACTGTTTTCAAAAA 
CCACCACAATAGAACAGTTGAAGTCAGGACTCTGAAGTCATTCTCTACTCTGGCCAACAACT 
TTGTTCTCATCGTGTCACAACTGCAACCCAGTCAAGAAAATGAGATGTTTTCCATCAGAGAC 
AGTGCACACAGGCGGTTTCTGCTATTCCGGAGAGCATTCAAACAGTTGGACGTAGAAGCAGC 
TCTGACCAAAGCCCTTGGGG7VAGTGGACATTCTTCTGACCTGGATGCAGAAATTCTACAAGC 
TCTGAATGTCTAGACCAGGACCTCCCTCCCCCTGGCACTGGTTTGTTCCCTGTGTCATTTCA 
AACAGTCTCCCTTCCTATGCTGTTCACTGGACACTTCACGCCCTTGGCCATGGGTCCCATTC 
TTGGCCCAGGATTATTGTCAAAGAAGTCATTCTTTAAGCAGCGCCAGTGACAGTCAGGGAAG 
GTGCCTCTGGATGCTGTGAAGAGTCTACAGAGAAGATTCTTGTATTTATTACAACTCTATTT 
AATTAATGTCAGTATTTCAACTGAAGTTCTATTTATTTGTGAGACTGTAAGTTACATGAAGG 
CAGCAGAATATTGTGCCCCATGCTTCTTTACCCCTCACAATCCTTGCCACAGTGTGGGGCAG 
TGGATGGGTGCTTAGTAAGTACTTAATAAACTGTGGTGCTTTTTTTGGCCTGTCTTTGGATT 
GTTAAAAAACAGAGAGGGATGCTTGGATGTAAAACTGAACTTCAGAGCATGAAAATCACACT 
GTCTTCTGATATCTGCAGGGACAGAGCATTGGGGTGGGGGTAAGGTGCATCTGTTTGAAAAG 
TAAACGATAAAATGTGGATTAAAGTGCCCAGCACAAAGCAGATCCTCAATAAACATTTCATT 
TCCCACCCACACTCGCCAGCTCACCCCATCATCCCTTTCCCTTGGTGCCCTCCTTTTTTTTT 
TATCCTAGTCATTCTTCCCTAATCTTCCACTTGAGTGTCAAGCTGACCTTGCTGATGGTGAC 
ATTGCACCTGGATGTACTATCCAATCTGTGATGACATTCCCTGCTAATAAAAGACAACATAA 
CTCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 320 

></usr/seqdb2/sst/DNA/Dnaseqs . f ull/ss . DNA88002 
xsubunit 1 of 1, 206 aa, 1 stop 
XMW: 23799, pi: 9.12, NX(S/T): 3 

MNFQQRLQSLWTLARPFCPPLLATASQMQMWLPCLGFTLLLWSQVSGAQGQEFHFGPCQVK 
GVyPQKLWEAFWAVKDTMQAQDNITSARLLQQEVLQNVSDAESCYLVHTLLEFYLKTVFKNH 
HNRTVEVRTLKSFSTLANNFVLIVSQLQPSQENEMFSIRDSAHRRFLLFRRAFKQLDVEAAL 
TKALGEVDILLTWMQKFYKL 

Signal sequence: 

amino acids 1-42 

N-glycosylation si'tes . 

amino acids 85-89, 99-103, 126-130 
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FIGORE 321 

AAGGAGCAGCCCGCAAGCACCAAGTGAGAGGCAT6AAGTTACAGTGTGTTTCCCTTTGGCTC 
CTGGGTACAATACTGATATTGTGCTCAGTAGACAACCACGGTCTCAGGAGATGTCTGATTTC 
CACAGACATGCACCATATAGAAGAGAGTTTCCAAGAAATCAAAAGAGCCATCCAAGCTAAGG 
ACACCTTCCCAAATGTCACTATCCTGTCCACATTGGAGACTCTGCAGATCATTAAGCCCTTA 
GATGTGTGCTGCGTGACCAAGAACCTCCTGGCGTTCTACGTGGACAGGGTGTTCAAGGATCA 
TCAGGAGCCAAACCCCAAAATCTTGAGAAAAATCAGCAGCATTGCCAACTCTTTCCTCTACA 
TGCAGAAAACTCTGCGGCAATGTCAGGAACAGAGGCAGTGTCACTGCAGGCAGGAAGCCACC 
AATGCCACCAGAGTCATCCATGACAACTATGATCAGCTGGAGGTCCACGCTGCTGCCATTAA 
ATCCCTGGGAGAGCTCGACGTCTTTCTAGCCTGGATTAATAAGAATCATGAAGTAATGTTCT 
CAGCTTGATGACAAGGAACCTGTATAGTGATCCAGGGATGAACACCCCCTGTGCGGTTTACT 
GTGGGAGACAGCCCACCTTGAAGGGGAAGGAGATGGGGAAGGCCCCTTGCAGCTGAAAGTCC 
CACTGGCTGGCCTCAGGCTGTCTTATTCCGCTTGAAAATAGGCAAAAAGTCTACTGTGGTAT 
TTGTAATAAACTCTATCTGCTGAAAGGGCCTGCAGGCCATCCTGGGAGTAAAGGGCTGCCTT 
CCCATCTAATTTATTGTAAAGTCATATAGTCCATGTCTGTGATGTGAGCCAAGTGATATCCT 
GTAGTACACATTGTACTGAGTGGTTTTTCTGAATAAATTCCATATTTTACCTATGA 
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322/330 



FIGURE 322 



></usr/seqdb2/sst/DNA/Dnaseqs . f ull/ss . DNA92282 
Xsubunit 1 of 1, 177 aa, 1 stop 
XMW: 20452, pi: 8.00, NX{S/T): 2 

MKLQCVSLWLLGTILILCSVDNHGLRRCLISTDMHHIEESFQEIKRAIQAKDTFPNVTILST 
LETLQIIKPLDVCCVTKNLLAFYVDRVFKDHQEPNPKILRKISSIANSFLYMQKTLRQCQEQ 
RQCHCRQEATNATRVIHDNYDQLEVHAAAIKSLGELDVFLAWINKNHEVMFSA 

Signal sequence: 

amino acids 1-18 

N-glycosylation sites. 

amino acids 56-60, 135-139 

cAMP- and cGMF -dependent protein kinase phosphorylation site. 

amino acids 102-106 

N-myristoylation site. 

amino acids 24-30 



Actinin-type actin-binding domain signature 1. 

amino acids 159-169 
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FIGURE 323 

CCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACTTGGCTTCGTTAG 
AACGCGGCTACAATTAATACATAACCTTATGTATCATACACATACGATTTAGGTGACACTAT 
AGAATAACATCCACTTTGCCTTTCTCTCCACAGGTGTCCACTCCCAGGTCCAACTGCACCTC 
GGTTCTATCGATAATCTCAGCACCAGCCACTCAGAGCAGGGCACGATGTTGGGGGCCCGCCT 
CAGGCTCTGGGTCTGTGCCTTGTGCAGCGTCTGCAGCATGAGCGTCCTCAGAGCCTATCCCA 
ATGCCTCCCCACTGCTCGGCTCCAGCTGGGGTGGCCTGATCCACCTGTACACAGCCACAGCC 
AGGAACAGCTACCACCTGCAGATCCACAAGAATGGCCATGTGGATGGCGCACCCCATCAGAC 
CATCTACAGTGCCCTGATGATCAGATCAGAGGATGCTGGCTTTGTGGTGATTACAGGTGTGA 
TGAGCAGAAGATACCTCTGCATGGATTTCAGAGGCAACATTTTTGGATCACACTATTTCGAC 
CCGGAGAACTGCAGGTTCCAACACCAGACGCTGGAAAACGGGTACGACGTCTACCACTCTCC 
TCAGTATCACTTCCTGGTCAGTCTGGGCCGGGCGAAGAGAGCCTTCCTGCCAGGCATGAACC 
CACCCCCGTACTCCCAGTTCCTGTCCCGGAGGAACGAGATCCCCCTAATTCACTTCAACACC 
CCCATACCACGGCGGCACACCCGGAGCGCCGAGGACGACTCGGAGCGGGACCCCCTGAACGT 
GCTGAAGCCCCGGGCCCGGATGACCCCGGCCCCGGCCTCCTGTTCACAGGAGCTCCCGAGCG 
CCGAGGACAACAGCCCGATGGCCAGTGACCCATTAGGGGTGGTCAGGGGCGGTCGAGTGAAC 
ACGCACGCTGGGGGAACGGGCCCGGAAGGCTGCCGCCCCTTCGCCAAGTTCATCTAGGGTCG 
CTGG 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss. DNA142238 
xsubunit 1 of 1, 251 aa, 1 stop 
XMW: 27954, pi: 9,22, NX(S/T): 1 

MLGARLRLWVCALCSVCSMSVLRAYPNASPLLGSSWGGLIHLYTATARNSYHLQIHKNGHVD 
GAPHQTIYSALMIRSEDAGFWITGVMSRRYLCMDFRGNIFGSHYFDPENCRFQHQTLENGY 
DVYHSPQYHFLVSLGRAKRAFLPGMNPPPYSQFLSRRNEIPLIHFNTPIPRRHTRSAEDDSE 
RDPLNVLKPRARMTPAPASCSQELPSAEDNSPMASDPLGVVRGGRVNTHAGGTGPEGCRPFA 
KFI 

Iznportant features of the protein: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site . 

amino acids 175-179 

N-myristoylation site. 

amino acids 33-39, 100-106, 225-231, 229-235 

HB6F/FGF family proteins 

amino acids 73-124 
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FIGURE 325 

GGAAAAGGTACCCGCGAGAGACAGCCAGCAGTTCTGTGGAGCAGCGGTGGCCGGCTAGGM6 
GGCTGTCTCTGGGGTCTGGCTCTGCCCCTTTTCTTCTTCTGCTGGGAGGTTGGGGTCTCTGG 
GAGCTCTGCAGGCCCCAGCACCCGCAGAGCAGACACTGCGATGACAACGGACGACACAGAAG 
TGCCCGCTATGACTCTAGCACCGGGCCACGCCGCTCTGGAAACTCAAACGCTGAGCGCTGAG 
ACCTCTTCTAGGGCCTCAACCCCAGCCGGCCCCATTCCAGAAGCAGAGACCAGGGGAGCCAA 
GAGAATTTCCCCTGCAAGAGAGACCAGGAGTTTCACAAAAACATCTCCCAACTTCATGGTGC 
TGATCGCCACCTCCGTGGAGACATCAGCCGCCAGTGGCAGCCCCGAGGGAGCTGGAATGACC 
ACAGTTCAGACCATCACAGGCAGTGATCCCGAGGAAGCCATCTTTGACACCCTTTGCACCGA 
TGACAGCTCTGAAGAGGCAAAGACACTCACAATGGACATATTGACATTGGCTCACACCTCCA 
CAGAAGCTAAGGGCCTGTCCTCAGAGAGCAGTGCCTCTTCCGACGGCCCCCATCCAGTCATC 
ACCCCGTCACGGGCCTCAGAGAGCAGCGCCTCTTCCGACGGCCCCCATCCAGTCATCACCCC 
GTCACGGGCCTCAGAGAGCAGCGCCTCTTCCGACGGCCCCCATCCAGTCATCACCCCGTCAT 
GGTCCCCGGGATCTGATGTCACTCTCCTCGCTGAAGCCCTGGTGACTGTCACAAACATCGAG 
GTTATTAATTGCAGCATCACAGAAATAGAAACAACAACTTCCAGCATCCCTGGGGCCTCAGA 
CATAGATCTCATCCCCACGGAAGGGGTGAAGGCCTCGTCCACCTCCGATCCACCAGCTCTGC 
CTGACTCCACTGAAGCAAAACCACACATCACTGAGGTCACAGCCTCTGCCGAGACCCTGTCC 
ACAGCCGGCACCACAGAGTCAGCTGCACCTCATGCCACGGTTGGGACCCCACTCCCCACTAA 
CAGCGCCACAGAAAGAGAAGTGACAGCACCCGGGGCCACGACCCTCAGTGGAGCTCTGGTCA 
CAGTTAGCAGGAATCCCCTGGAAGAAACCTCAGCCCTCTCTGTTGAGACACCAAGTTACGTC 
AAAGTCTCAGGAGCAGCTCCGGTCTCCATAGAGGCTGGGTCAGCAGTGGGCAAAACAACTTC 
CTTTGCTGGGAGCTCTGCTTCCTCCTACAGCCCCTCGGAAGCCGCCCTCAAGAACTTCACCC 
CTTCAGAGACACCGACCATGGACATCGCAACCAAGGGGCCCTTCCCCACCAGCAGGGACCCT 
CTTCCTTCTGTCCCTCCGACTACAACCAACAGCAGCCGAGGGACGAACAGCACCTTAGCCAA 
GATCACAACCTCAGCGAAGACCACGATGAAGCCCCAACAGCCACGCCCACGACTGCCCGGAC 
GAGGCCGACCACAGACGTGAGTGCAGGTGAAAATGGAGGTTTCCTCCTCCTGCGGCTGAGTG 
TGGCTTCCCCGGAAGACCTCACTGACCCCAGAGTGGCAGAAAGGCTGATGCAGCAGCTCCAC 
CGGGAACTCCACGCCCACGCGCCTCACTTCCAGGTCTCCTTACTGCGTGTCAGGAGAGGCTA 
ACGGACATCAGCTGCAGCCAGGCATGTCCCGTATGCCAAAAGAGGGTGCTGCCCCTAGCCTG 
GGCCCCCACCGACAGACTGCAGCTGCGTTACTGTGCTGAGAGGTACCCAGAAGGTTCCCATG 
AAGGGCAGCATGTCCAAGCCCCTAACCCCAGATGTGGCAACAGGACCCTCGCTCACATCCAC 
CGGAGTGTATGTATGGGGAGGGGCTTCACCTGTTCCCAGAGGTGTCCTTGGACTCACCTTGG 
CACATGTTCTGTGTTTCAGTAAAGAGAGACCTGATCACCCATCTGTGTGCTTCCATCCTGCA 
TTAAAATTCACTCAGTGTGGCCCAAAAAAAA 
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FIGURE 326 

MGCLWGLALPLFFFCWEVGVSGSSAGPSTREUVDTAMTTDDTEVPAMTLAPGHAALETQTLSA 
ETSSRASTPAGPIPEAETRGAKRISPARETRSFTKTSPNFMVLIATSVETSAASGSPEGAGM 
TTVQTITGSDPEEAIFDTLCTDDSSEEAKTLTMDILTLAHTSTEAKGLSSESSASSDGPHPV 
ITPSRASESSASSDGPHPVITPSRASESSASSDGPHPVITPSWSPGSDVTLLAEALVTVTNI 
EVINCSITEIETTTSSIPGASDIDLIPTEGVKASSTSDPPALPDSTEAKPHITEVTASAETL 
STAGTTESAAPHATVGTPLPTNSATEREVTAPGATTLSGALVTVSRNPLEETSALSVETPSY 
VKVSGAAPVSIEAGSAVGKTTSFAGSSASSYSPSEAALKNFTPSETPTMDIATKGPFPTSRD 
PLPSVPPTTTNSSRGTNSTLAKIT.TSAKTTMKPQQPRPRLPGRGRPQT 

N-glycosylation sl'tes: 

amino acids 252-256, 445-449, 451-455 

cAMP-and cGMP-dependen't protein kinase phosphorylation site. 

amino acids 84-90 

Casein kinase II phosphorylation sites. 

amino acids 37-41, 108-112, 131-135, 133-137, 148-152, 165-169, 
246-250, 254-258, 256-260, 269-273, 283-287, 333-337, 335-339, 
404-408, 414-418, 431-435 

N-myristoylation sites . 

amino acids 2-8, 19-25, 117-123, 121-127, 232-238, 278-284, 314- 
320, 349-355, 386-392, 397-403, 449-455 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 385-393 
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FIGURE 327 

GCGGAGCATCCGCTGCGGTCCTCGCCGAGACCCCCGCGCGGATTCGCCGGTCCTTCCCGCGG 
GCGCGACAGAGCTGTCCTCGCACCTGGATGGCAGCAGGGGCGCCGGGGTCCTCTCGACGCCA 
GAGAGAAATCTCATCATCTGTGCAGCCTTCTTAAAGCAAACTAAGACCAGAGGGAGGATTAT 
CCTTGACCTTTGAAGACCAAAACTAAACTGAAATTTAAT^ATGTTCTTCGGGGGAGAAGGGAG 
CTTGACTTACACTTTGGTAATAATTTGCTTCCTGACACTAAGGCTGTCTGCTAGTCAGAATT 
GCCTCAAAAAGAGTCTAGAAGATGTTGTCATTGACATCCAGTCATCTCTTTCTAAGGGAATC 
AGAGGCAATGAGCCCGTATATACTTCAACTCAAGAAGACTGCATTAATTCTTGCTGTTCAAC 
AAAAAACATATCAGGGGACAAAGCATGTAACTTGATGATCTTCGACACTCGAAAAACAGCTA 
GACAACCCAACTGCTACCTATTTTTCTGTCCCAACGAGGAAGCCTGTCCATTGAAACCAGCA 
AAAGGACTTATGAGTTACAGGATAATTACAGATTTTCCATCTTTGACCAGAAATTTGCCAAG 
CCAAGAGTTACCCCAGGAAGATTCTCTCTTACATGGCCAATTTTCACAAGCAGTCACTCCCC 
TAGCCCATCATCACACAGATTATTCAMGCCCACCGATATCTCATGGAGAGACACACTTTCT 
CAGAAGTTTGGATCCTCAGATCACCTGGAGAAACTATTTAAGATGGATGAAGCAAGTGCCCA 
GCTCCTTGCTTATAAGGAAAAAGGCCATTCTCAGAGTTCACAATTTTCCTCTGATCAAGAAA 
TAGCTCATCTGCTGCCTGAAAATGTGAGTGCGCTCCCAGCTACGGTGGCAGTTGCTTCTCCA 
CATACCACCTCGGCTACTCCAAAGCCCGCCACCCTTCTACCCACCAATGCTTCAGTGACACC 
TTCTGGGACTTCCCAGCCACAGCTGGCCACCACAGCTCCACCTGTAACCACTGTCACTTCTC 
AGCCTCCCACGACCCTCATTTCTACAGTTTTTACACGGGCTGCGGCTACACTCCAAGCAATG 
GCTACAACAGCAGTTCTGACTACCACCTTTCAGGCACCTACGGACTCGAAAGGCAGCTTAGA 
AACCATACCGTTTACAGAAATCTCCAACTTAACTTTGAACACAGGGAATGTGTATAACCCTA 
CTGCACTTTCTATGTCAAATGTGGAGTCTTCCACTATGAATAAAACTGCTTCCTGGGAAGGT 
AGGGAGGCCAGTCCAGGCAGTTCCTCCCAGGGCAGTGTTCCAGAAAATCAGTACGGCCTTCC 
ATTTGAAAAATGGCTTCTTATCGGGTCCCTGCTCTTTGGTGTCCTGTTCCTGGTGATAGGCC 
TCGTCCTCCTGGGTAGAATCCTTTCGGAATCACTCCGCAGGAAACGTTACTCAAGACTGGAT 
TATTTGATCAATGGGATCTATGTGGACATCTA^GGATGGAACTCGGTGTCTCTTAATTCATT 
TAGTAACCAGAAGCCCAAATGCAATGAGTTTCTGCTGACTTGCTAGTCTTAGCAGGAGGTTG 
TATTTTGAAGACAGGAAAATGCCCCCTTCTGCTTTCCTTTTTTTTTTTGGAGACAGAGTCTT 
GCTCTGTTGCCCAGGCTGGAGTGCAGTAGCACGATCTCGGCTCTCACCGCAACCTCCGTCTC 
CTGGGTTCMGCGATTCTCCTGCCTCAGCCTCCTAAGTATCTGGGATTACAGGCATGTGCCA 
CCACACCTGGGTGATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGTCAGGCTG 
GTCTCAAACTCCTGACCTAGTGATCCACCCTCCTCGGCCTCCCAAAGTGCTGGGATTACAGG 
CATGAGCCACCACAGCTGGCCCCCTTCTGTTTTATGTTTGGTTTTTGAGAAGGAATGAAGTG 
GGAACCAAATTAGGTAATTTTGGGTAATCTGTCTCTAAAATATTAGCTAAAAACAAAGCTCT 
ATGTAAAGTAATAAAGTATAATTGCCATATAAATTTCAAAATTCAACTGGCTTTTATGCAAA 
GAAACAGGTTAGGACATCTAGGTTCCAATTCATTCACATTCTTGGTTCCAGATAAAATCAAC 
TGTTTATATCAATTTCTAATGGATTTGCTTTTCTTTTTATATGGATTCCTTTAAAACTTATT 
CCAGATGTAGTTCCTTCCAATTAAATATTTGAATAAATCTTTTGTTACTCAA 
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FIGimE 328 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA45410 
xsubunit 1 of 1, 431 aa, 1 stop 
XMW: 46810, pi: 6.45, NX(S/T): 6 

MFFGGEGSLTYTLVIICFLTLRLSASQNCLKKSLEDVVIDIQSSLSKGIRGNEPVYTSTQED 
CINSCCSTKNISGDKACNLMIFDTRKTARQPNCYLFFCPNEEACPLKPAKGLMSYRIITDFP 
SLTRNLPSQELPQEDSLLHGQFSQAVTPLAHHHTDYSKPTDISWRDTLSQKFGSSDHLEKLF 
KMDEASAQLLAYKEKGHSQSSQFSSDQEIAHLLPENVSALPATVAVASPHTTSATPKPATLL 
PTNASVTPSGTSQPQLATTAPPVTTVTSQPPTTLISTVFTRAAATLQAMATTAVLTTTFQAP 
TDSKGSLETIPFTEISNLTLNTGNVYNPTALSMSNVESSTMNKTASWEGREASPGSSSQGSV 
PENQYGLPFEKWLLIGSLLFGVLFLVIGLVLLGRILSESLRRKRYSRLDYLINGIYVDI 

Signal sequence . 

amino acids 1-25 

Treuismembrane domain. 

amino acids 384-405 

N-glycosylation si'tes . 

amino acids 72-76, 222-226, 251-255, 327-331, 352-356 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 415-419 

Tyrosine kinase phosphorylation site. 

amino acids 50-57 

N-myristoylation sites. 

amino acids 4-10, 48-54, 315-321 
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FIGURE 329 

CTCCCACGGTGTCCAGCGCCCAGAAT6CGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCT 
CCCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGT 
CCCTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGT 
GGGATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAGGAGACAAT 
GAAGGGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGA 
ACCTCACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAG 
TCTTTACTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCCTCCCCTTCTCCCAC 
CTTCCAGCCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCC 
CAGGATTGACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGG 
GCTGAGGCCCCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACAC 
AGGAACCTCTCCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGC 
TGGACTCCACCTCAGCAGAGGACACCAGTCCAGCTCTCAGCAGTGGCAGCTCTAAGCCCAGG 
GTGTCCATCCCGATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGC 
CGCAGGCCTGATCGCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCA 
CGGAGACACAGAGGAACGAGAAGTTCTGGCTCTCACGCTTGACTGCGGAGGAAAAGGAAGCC 
CCTTCCCAGGCCCCTGAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGA 
GCTGGGCTTCTCGAAGTTTGTCTCAGCGTAGGGCAGGAGGCCCTCCTGGCCAGGCCAGCAGT 
GAAGCAGTATGGCTGGCTGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAG 
TCCAGCTGCCCGGACTCCAGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCA 
GCCTGACCTAGAAGCGTTTGTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTG 
GAGACTGGGACATCCCTGATAGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCA 
GCAGGGCCAGACAAGGCTCAGTGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGC 
CTCATGCCCAGTGTCGGACCCTGCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCC 
TGGCGTCCTCAGACTTAGTCCCACGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCT 
GGGGTGAGACTGGGATTCTGGCTTCTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCT 
GTGAAAAACGTGATTCCTGGCCCCACCAAGACCCACCAAAACCATCTCTGGGCTTGGTGCAG 
GACTCTGAATTCTAACAATGCCCAGTGACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTG 
ATGAACGCTCACACCCCTTCAGCTTAGAGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCC 
CAATAGATCTGCTCTGTCTGCGACACCAGATCCACGTGGGGACTCCCCTGAGGCCTGCTAAG 
TCCAGGCCTTGGTCAGGTCAGGTGCACATTGCAGGATAAGCCCAGGACCGGCACAGAAGTGG 
TTGCCTTTNCCATTTGCCCTCCCTGGNCCATGCCTTCTTGCCTTTGGAAAAAATGATGAAGA 
AAACCTTGGCTCCTTCCTTGTCTGGAAAGGGTTACTTGCCTATGGGTTCTGGTGGCTAGAGA 
GAAAAGTAGAAAACCAGAGTGCACGTAGGTGTCTAACACAGAGGAGAGTAGGAACAGGGCGG 
ATACCTGAAGGTGACTCCGAGTCCAGCCCCCTGGAGAAGGGGTCGGGGGTGGTGGTAAAGTA 
GCACAACTACTATTTTTTTTCTTTTTCCATTATTATTGTTTTTTAAGACAGAATCTCGTGCT 
GCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCAAACTCCGCCTCCTGGGTTCAAGTGATT 
CTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCACCACACCTGGCTAATT 
TTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGAC 
CTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCGGGATTACAGGCATGAGCCACTGTG 
TCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTGTTCAGTATGCAAAACTTGGAAAG 
ATGGAGGAGAAAAAGAAAAGGAAGAAAAAAATGTCACCCATAGTCTCACCAGAGACTATCAT 
TATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTCACATAATTTGCCGGTGTTCTT 
TTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGCCTTTTCCACCTTATCGTTCC 
ATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTTATAAATAAAATGTTCATCA 
GCTGCATAAAAAAAAAAAAAA 
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FIGURE 330 

</usr/seqdb2/sst/DNA/Dnaseqs,min/ss. DNA4 4196 
<subunit 1 of 1, 332 aa, 1 stop 
<MW: 36143, pi: 5.89, NX(S/T): 1 

MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREELRDHRKYWCRKGGILFSRCS 
GTIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFV 
FPGPCCPPSPSPTFQPLATTRLQPECAKAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPG 
TSQYGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRI 
LAPVLVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGD 
VISMPPLHTSEEELGFSKFVSA 

Impor'tanb fea'tures: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 248-269 

N-glycosylation site. 

amino acids 96-99 

Fibrinogen beta and gamma chains C- terminal domain. 

amino acids 104-113 

Ig like V-type domain: 

amino acids 13-128 
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This International Searching Authority found multiple (groups of) 
Inventions In this international application, as follows: 

1. Claims: Invention 1: claims l-28» all partially 

Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR0281 (seq.ID.2), 
vector comprising said nucleic acid, host cell comprising 
said vector, method of producing said protein using said 
host, isolated protein having at least 80% homology to 
PR0281, chimeric protein comprising a portion corresponding 
to PR0281, antibody against si ad protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. 



2. Claims: Inventions 2-132: claims 1-28, 

all partially and claims 4 and 13 as far as 
applicable 



Subject matter as defined for invention 1 above, but limited 
to the respective amino acid sequences and cerrespondi ng 
nucleic acid sequences referred to as: 

2. PR0276 (seq.ID's 5/6), 

3. PR0189 (Seq.ID's 7/8), 

4. PRO190 (Seq.ID's 13/14), 

5. PR0341 (Seq.ID's 19/20). 

6. PRO180 (Seq.ID's 22/23), 

7. PRO190 (Seq.ID's 13/14), 

8. PR0194 (Seq.ID's 27/28), 

9. PRO203 (Seq.ID's 29/30), 

10. PR0290 (Seq.ID's 32/33 , 

11. PR0874 (Seq.ID's 35/36), 

12. PRO710 (Seq.ID's 40/41), 

13. PR01151 (Seq.ID's 46/47), 

14. PR01281 (Seq.ID's 51/52), 

15. PR0358 (Seq.ID's 56/57), 

16. PRO1310 (Seq.ID's 61/62), 

17. PR0698 (Seq.ID's 66/67), 

18. PR0732 (Seq.ID's 72/73), 

19. PRO1120 (Seq.ID's 83/84), 

20. PR0537 (Seq.ID's 94/95), 

21. PR0536 (Seq.ID's 96/97), 

22. PR0535 (Seq.ID's 98/99), 

23. PR0718 (Seq.ID's 102/103), 

24. PR0872 (Seq.ID's 112/113). 

25. PRO1063 (Seq.ID's 114/115), 

26. PR0619 (Seq.ID's 116/117). 

27. PR01188 (Seq.ID's 123/124), 

28. PR0784 (Seq.ID's 134/135), 

29. PR0783 Seq.ID's 137/138). 

30. PRO820 (Seq.ID's 145/146), 
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31. 
32. 
33. 
34. 
35. 
36. 
37. 
38. 
39. 
40. 
41. 
42. 
43. 
44. 
45. 
46. 
47. 
48. 
49. 
50. 
51. 
52. 
53. 
54. 
55. 
56. 
57. 
58. 
59. 
60. 
61. 
62. 
63. 
64. 
65. 
66. 
67. 
68. 
69. 
70. 
71. 



PR01G80 (Seq.ID's 147/148) 
PR01079 (Seq.ID's 150/151) 
PR0793 (Seq.ID's 152/153), 
PRO1016 (Seq.ID's 155/156) 
PRO1013 (Seq.ID's 157/158) 
PR0937 (Seq.ID's 159/160), 
PR0842 (Seq.ID's 164/165), 
PR0839 (Seq.ID's 166/167), 
PRO1180 (Seq.ID's 168/169) 
PR01134 (Seq.ID's 170/171) 
PR0830 (Seq.ID's 174/175), 
PR01115 (Seq.ID's 176/177) 
PR01277 (Seq.ID's 178/179) 
PR01135 (Seq.ID's 180/181) 
PR0828 (Seq.ID's 188/189). 
PRO1009 (Seq.ID's 193/194) 
PRO1007 (Seq.ID's 196/197) 
PRO1056 (Seq.ID's 198/199) 
PR0826 (Seq.ID's 200/201). 
PR0819 (Seq.ID's 202/203), 
PRO10O6 (Seq.ID's 204/205) 
PR01112 (Seq.ID's 206/207) 
PRO1074 (Seq.ID's 208/209) 
PR01005 (Seq.ID's 210/211] 
PRO1073 (Seq.ID's 212/213) 
PR01152 (Seq.ID's 215/216) 
PR01136 (Seq.ID's 218/219) 
PR0813 (Seq.ID's 220/221), 
PRO809 (Seq.ID's 222/223), 
PR0791 (Seq.ID's 224/225), 



PRO1004 
PROllll 
PR01344 
PRO1109 
PR01383 
PRO1003 
PR01108 
PR01137 
PR01138 
PR01054 



Seq.ID's 226/227), 
Seq.ID's 228/229), 
Seq.ID's 230/231), 
Seq.ID's 235/236), 
Seq.ID's 240/241) 
Seq.ID's 245/246), 
Seq.ID's 247/248), 
Seq.ID's 249/250)1 
Seq.ID's 252/253), 
Seq.ID's 255/256) 



PR0994 (Seq.ID's 257/258). 



3. Claim : 1 



72. PR0812 (Seq.ID's 259/260), 

73. PR01069 (Seq.ID's 261/262). 

74. PR01129 (Seq.ID's 263/264), 

75. PR01068 (Seq.ID's 265/266 . 

76. PRO1066 (Seq.ID's 267/268), 

77. PR01184 (Seq.ID's 269/270). 

78. PR0136O (Seq.ID's 271/272), 

79. PR01029 (Seq.ID's 273/274). 

80. PR01139 (Seq.ID's 275/276), 

81. PR01309 (Seq.ID's 277/278), 
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82. PRO1028 (Seq. 

83. PRO1027 (Seq. 

84. PRO1107 (Seq. 

85. PRO1140 (Seq. 

86. PRO1105 (Seq. 

87. PR01291 (Seq. 

88. PRO1105 (Seq. 

89. PR0511 (SeqJ 

90. PRO1104 (Seq. 

91. PROllOO (Seq. 

92. PR0836 (Seq. I 

93. PR01141 (Seq. 

94. PR01132 (Seq. 

95. PR01346 (Seq. 

96. PR01131 (Seq. 

97. PR01281 (Seq. 

98. PR01064 (Seq. 

99. PR01379 (Seq. 

100. PRa844 (Seq. 

101. PR0848 (Seq. 

102. PRO1097 (Seq 

103. PR01153 (Seq 

104. PR01154 (Seq 

105. PR01182 (Seq 

106. PR01155 (Seq 

107. PR01156 (Seq 

108. PR01098 (Seq 

109. PR0H27 (Seq 

110. PR01126 (Seq 

111. PR01125 (Seq 

112. PR01186 (Seq 

113. PR01198 (Seq 

114. PR01158 (Seq 

115. PR01159 (Seq 

116. PR01124 (Seq 

117. PR01287 (Seq 

118. PR01312 (Seq 

119. PR01192 (Seq 

120. PR01160 (Seq 

121. PR01187 (Seq 

122. PR01185 (Seq 

123. PR01345 (Seq 

124. PR01245 (Seq 

125. PR01358 (Seq 

126. PR01195 (Seq 

127. PRO1270 (Seq 

128. PR01271 (Seq 

129. PR01375 (Seq 

130. PR01385 (Seq 

131. PR01384 (Seq 

132. PR09828 (Seq 



D's 280/281). 

D's 282/283), 

D's 284/285). 

D's 286/287). 

D's 288/289), 

D's 290/291 . 

D's 292/293). 

's 294/295). 

D's 296/297), 

D's 298/299). 

's 300/301). 

D's 302/303). 

D's 308/309). 

D's 313/314). 

D's 318/319). 

D's 325/326). 

D's 333/334). 

D's 339/340). 

D's 344/345). 

D's 346/347). 
D's 348/349 
D's 350/351 
D's 352/353 
D's 356/357 
D's 358/359 
D's 360/361 
D'S 362/363 
D's 364/365 
D'S 366/367 
D's 368/369 
D's 370/371 
D's 372/373 
D's 374/375 
D's 376/377 
D's 378/379 
D's 380/381 
D's 386/387 
D's 388/389 
D's 393/394 
D's 398/399 
D's 400/401 
D's 402/403 
D's 407/408 
D's 409/410 
D's 411/412 
D's 413/414 
D's 415/416 
D's 417/418 
D's 419/420 
D's 423/424 
D's 510/511 



For the sake of conciseness, the first subject matter is 
explicitly defined, the other subject matters are defined by 
analogy thereto. 
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4. Claims: Invention 133: claims 1-36,59-74,99-102, 

all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR0943 (seq.ID.118) , 
vector comprising said nucleic acid, host cell comprising 
said vector, method of producing said protein using said 
host, isolated protein having at least 80% homology to 
PR0943, chimeric protein comprising a portion corresponding 
to PR0943, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PR0183, PR0184 or 
PR0185 through interacti on with PR0943 and vice versa* 
method of linking a bioactive molecule to a cell expressing 
PR0943 using PR0183, PR0184 or PR0185 as binding ligands and 
vice versa, and method of modulating the activity of PR0943 
by binding with PR0183, PR0184 or PR0185, and vice versa. 



5. Claims: Inventions 134-136: claims 1-36,69-74,99-102, 
all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR0183 (seq.ID.495) , 
PR0184 (seq.ID.497) or PR0185 (seq.ID.499) , vector 
comprising said nucleic acid, host cell comprising said 
vector, method of producing said protein using said host, 
isolated protein having at least 80% homology to said 
protien, chimeric protein comprising a portion corresponding 
to said protein, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PR0183, PR0184 or 
PR0185 through interaction with PR0943 and vice versa, 
method of linking a bioactive molecule to a cell expressing 
PR0943 using PR0183, PR0184 or PR0185 as binding ligands and 
vice versa, and method of modulating the activity of PR0943 
by binding with PR0183, PR0184 or PR0185, and vice versa, 
whereby invention 134 is limited to PR0183, invention 135 is 
limited to PR0184, and invention 136 is limited to PR0185. 



6. Claims: Invention 137: claims 1-28,37-44,75-80,103-106, 
all partially and claims 4 and 13 as far as 
applicable 
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Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR0331 (seq.ID.50lK 
vector comprising said nucleic acid, host cell comprising 
said vector, method of producing said protein using said 
host, isolated protein having at least 80% homology to 
PR0331, chimeric protein comprising a portion corresponding 
to PR0331, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PR01133 through 
interaction with PR0331 and vice versa, method of linking a 
bioactive molecule to a cell expressing PR0331 using PR01133 
as binding ligands and vice versa, and method of modulating 
the activity of PR0331 by binding with PR01133, and vice 
versa. 



7. Claims: Invention 138: claims 1-28,37-44,75-80,103-106, 
all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR01133 (seq.ID.129) , 
vector comprising said nucleic acid, host cell comprising 
said vector, method of producing said protein using said 
host, isolated protein having at least 80% homology to said 
protien, chimeric protein comprising a portion corresponding 
to said protein, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PR01133 through 
interaction with PR0331 and vice versa, method of linking a 
bioactive molecule to a cell expressing PR0331 using PR01133 
as binding ligands and vice versa, and method of modulating 
the activity of PR0331 by binding with PR01133, and vice 
versa. 



8. Claims: Invention 139: claims 1-28,45-52,81-86,107-110, 
all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR01387 (seq.ID.422) , 
vector comprising said nucleic acid, host cell comprising 
said vector, method of producing said protein using said 
host, isolated protein having at least 80% homology to 
PR01387, chimeric protein comprising a portion corresponding 
to PR01387, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
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least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PR0363 or PR05723 
through interaction with PR01387 and vice versa, method of 
linking a bioactive molecule to a cell expressing PR01387 
using PR0363 or PR05723 as binding ligands and vice versa, 
and method of modulating the activity of PR01387 by binding 
with PR0363 or PR05723, and vice versa. 



9. Claims: Invention 140 and 141: claims 1-28,45-52,81-86, 
107-110. 

all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR0363 (seq.ID.503) or 
PR05723 (seq.ID.505), vector comprising said nucleic acid, 
host cell comprising said vector, method of producing said 
protein using said host, isolated protein having at least 
80% homology to said protein, chimeric protein comprising a 
portion corresponding to said protein, antibody against said 
protein, the isolated extracellular domain of said protein 
or a protein with at least 80% homology thereto, and said 
isolated protein lacking its signal peptide or a protein 
with at least 80% homology thereto. Method for detecting 
PR0363 or PR05723 through interaction with PR01387 and vice 
versa, method of linking a bioactive molecule to a cell 
expressing PR01387 using PR0363 or PR05723 as binding 
ligands and vice versa, and method of modulating the 
activity of PR01387 by binding with PR0363 or PR05723, and 
vice versa, whereby invention 140 is limited to PR0363 and 
invention 141 is limited to PR05723. 



10. Claims: Invention 142: claims 1-28, 53-60.87-92,111-114. 

all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR01114 (seq.ID.183) , 
vector comprising said nucleic acid, host cell comprising 
said vector, method of producing said protein using said 
host, isolated protein having at least 80% homology to 
PR01114, chimeric protein comprising a portion corresponding 
to PR01114, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PRO3301 or PR09940 
through interaction with PR01114 and vice versa, method of 
linking a bioactive molecule to a cell expressing PR01114 
using PR03301 or PR09940 as binding ligands and vice versa. 
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and method of modulating the activity of PR01114 by binding 
with PRO3301 or PR09940, and vice versa. 



11. Claims: Invention 143 and 144: claims 1-28, 53-60,87-92, 
111-114, 

all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR033G1 (seq.ID.507) 
or PRO9940 (seq.ID.509) , vector comprising said nucleic 
acid, host cell comprising said vector, method of producing 
said protein using said host, isolated protein having at 
least 80% homology to said protein, chimeric protein 
comprising a portion corresponding to said protein, antibody 
against said protein, the isolated extracellular domain of 
said protein or a protein with at least 80% homology 
thereto, and said isolated protein lacking its signal 
peptide or a protein with at least 80% homology thereto. 
Method for detecting PRO3301 or PRO9940 through interaction 
with PR01114 and vice versa, method of linking a bioactive 
molecule to a cell expressing PR01114 using PR03301 or 
PR09940 as binding ligands and vice versa, and method of 
modulating the activity of PR01114 by binding with PR03301 
or PRO9940, and vice versa, whereby invention 143 is limited 
to PR03301 and invention 144 is limited to PR09940. 



12. Claims: Invention 145: claims 1-28,61-69,93-98,115-118, 
all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR01181 {seq.ID.355) , 
vector comprising said nucleic acid, host cell comprising 
said vector, method of producing said protein using said 
host, isolated protein having at least 80% homology to 
PR01181, chimeric protein comprising a portion corresponding 
to PR01181, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated-prot&in 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PR07170, PR0361 or 
PR0846 through interaction with PR01181 and vice versa, 
method of linking a bioactive molecule to a cell expressing 
PR01181 using PRO7170, PR0361 or PR0846 as binding ligands 
and vice versa, and method of modulating the activity of 
PR01181 by binding with PR07170, PR0361 or PR0846, and vice 
versa. 



13. Claims: Inventions 146-148: claims 1-28,51-69,93-98, 
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115-118, 

all partially and claims 4 and 13 as far as 
applicable 



Isolated nucleic acid having at least 80% sequence homology 
to a polynucleotide sequence encoding PR07170 (seq.ID.513) , 
PR0361 (seq.ID.515) or PRD846 (seq.ID.517) , vector 
comprising said nucleic acid, host cell comprising said 
vector, method of producing said protein using said host, 
isolated protein having at least 80% homology to said 
protien, chimeric protein comprising a portion corresponding 
to said protein, antibody against said protein, the isolated 
extracellular domain of said protein or a protein with at 
least 80% homology thereto, and said isolated protein 
lacking its signal peptide or a protein with at least 80% 
homology thereto. Method for detecting PR07170, PR0361 or 
PR0845 through interaction with PR01181 and vice versa, 
method of linking a bioactive molecule to a cell expressing 
PR01181 using PR0717G, PR0361 or PR0846 as binding ligands 
and vice versa, and method of modulating the activity of 
PR01181 by binding with PRO7170, PR0361 or PR0846, and vice 
versa, whereby invention 146 is limited to PRO7170, 
invention 147 is limited to PR0361, and invention 148 is 
limited to PR0846. 
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